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Introduction 


Genomic  instability  is  a  hallmark  of  cancer.  Central  to  a  cells  ability  to  maintain  genomic  stability  are 
systems  that  monitor  and  repair  DNA  double  strand  breaks  (DSBs).  The  objective  of  this  study  is  to 
understand  how  the  choice  of  pathways  used  to  repair  DNA  damage  determines  whether  the  repair  is  error 
free  or  causes  genomic  instability.  In  mammalian  cells,  homologous  recombination  (HR)  and 
nonhomologous  end  joining  (NHEJ)  are  the  two  major  pathways  involved  in  the  repair  of  DNA  DSBs. 

The  Brcal  gene  is  required  for  DNA  repair  by  homologous  recombination  and  normal  embryonic 
development.  Additionally,  protein  53BP1  promotes  ligation  and  facilitates  end  joining.  In  previous 
studies,  we  have  demonstrated  that  Brcal  and  53BP1  can  compete  for  the  processing  of  DSBs  and  that 
53BP1  can  promote  genomic  instability  in  the  absence  of  Brcal.  Thus,  the  tumor  suppressive  function  of 
Brcal  does  not  appear  to  be  absolute  and  can  be  modulated  by  altering  the  ability  of  cells  to  carryout 
NHEJ.  Our  study  focuses  on  shifting  the  balance  between  these  two  repair  pathways  (HR  and  NHEJ)  to 
restore  error  free  repair  and  genomic  stability.  We  believe  that  a  better  understanding  of  mechanisms  of 
DSB  repair  pathway  choice  may  have  important  therapeutic  implications  for  prevention  or  treatment  of 
Brcal/2  germline  mutation-associated  cancers. 

Body 

Aim  1:  Determine  the  capacity  of  NHEJ  deficiency  to  rescue  defects  in  homologous  recombination  (HR). 
Using  various  established  mouse  models  where  there  is  a  clearly  described  defect  in  HR,  we  will  test  the 
role  of  the  NHEJ  proteins  53BP1  and  Ku  in  subverting  HR. 

We  have  discovered  a  new  NHEJ  factor  called  RIF1  that  acts  downstream  of  53BP1  in  blocking 
resection.  This  work  was  recently  published  (M.  Di  Virgilio  et  al.  Science  2013).  The  relevant  data  is 
shown  in  Figure  4  of  this  publication  as  follows,  which  demonstrates  increase  RPA  association  at  the  IgH 
locus: 


A  RPA 
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We  are  currently  crossing  mice  containing  floxed  alleles  of  Brea  1  with  RIF1""  mice  to  generate 
Brcalf/fRIFl+/+  and  Brcalf/fRIFr/_  mice.  We  propose  to  test  these  cells  for  HR  reconstitution  by 
assaying  for  the  formation  of  chromosomal  breaks  and  radial  structures,  Rad51  foci  formation, 
RPA  phosphorylation  and  PARPi  sensitivity. 

The  proposed  PALB2fl/fl  x  53BP1  crosses  have  also  been  generated  and  characterized.  This  work 
was  recently  accepted  for  publication  (C.  Bowman-Colin  et  al.  PNAS  2013).  The  relevant  data  is 
shown  in  Figure  4  of  this  publication  as  follows: 


-PAR  Pi  frPARPt 


— I - 1 I - 1 1 - ■ -  V  '  ■ 

WT  Palb2  ■'  ;  Pafb2F  WT  Palb2 '  ;  PaEb2'  ; 

Trp53bp1+/-  Trp53bp1^‘  Trp53bpt^  Trp53bp1  r* 


This  demonstrates  that  unlike  BRCAL  loss  of  53BP1  (Trp53BPl)  does  not  rescue  the  genome 
instability  associated  with  deletion  of  PALB2. 


Aim  2:  Determine  the  domain  of  53BP1  that  inhibits  HR  in  Brea  1 -deficient  mice.  We  will  use  a 
combined  in  vitro  and  in  vivo  reconstitution  approach  to  define  the  functional  domains  of  53BP1  that 
regulate  the  observed  HR  defects  seen  in  Brea  1 -deficient  cells. 

We  are  using  retroviral  gene  transfer  to  reconstitute  53BP1  mutants  into  BrcalA11/  A1153BPr/_  B  cells. 
Given  that  BrcalA11/A1153BPr  "  B  cells  are  resistant  to  PARP  inhibitors,  we  are  using  restoration  of  PARP 
sensitivity  as  an  initial  screen  to  determine  the  relevant  domain  of  53BP1  required  for  inhibiting  HR 
activity.  Simultaneously  we  are  examining  the  domains  required  for  class  switching.  Retroviral  constructs 
include  full-length  53BP1,  53BP1EV  (empty  vector),  53BP1db  and  53BP18A  We  show  that  a  53BP1 
phospho-mutant  53BP18A,  comprising  alanine  substitutions  of  the  8  most  N-terminal  S/TQ 
phosphorylation  sites,  mimics  53BP1  deficiency  by  restoring  genome  stability  in  BRCA1  deficient  cells 
yet  behaves  like  wild-type  53BP1  with  respect  to  immunoglobulin  class  switch  recombination  (CSR):  The 
relevant  data  is  shown  below,  and  we  have  recently  submitted  a  paper  based  on  these  findings: 
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These  results  indicate  53BP18A  behaves  like  a  53BP1  null  with  respect  to  rescuing  BRCA1 
deficiency,  but  is  WT  for  class  switching. 


In  the  context  of  this  proposal,  we  have  been  interested  to  know  whether  there  are  preferred  “genomic 
sites”  for  genome  instability  during  DNA  replication.  Recently  we  discovered  a  novel  type  of  “fragile” 
site  that  contributes  to  genome  instability.  These  sites  (coined  “ERFS”  are  broken  spontaneously  during 
replication,  and  their  fragility  is  increased  by  hydroxyurea,  ATR  inhibition,  deregulated  c-Myc  expression 
and  by  PARPi  treatment  of  BRCA1  deficient  cells.  This  work  was  recently  published  (J.  Barlow  et  al. 
Cell;  152:620-632,  2013).  The  relevant  data  is  shown  in  figure  3  of  this  manuscript,  which  shows 
examples  and  quantitation  of  ERFS  breakage  in  response  to  HU: 
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Aim  3:  Develop  small  molecule  inhibitors  of  53BP1  as  possible  lead  compounds  to  inhibit  Brea- 
mediated  tumor  formation.  This  highly  ambitious  project  is  ongoing. 

Task  1:  Construct  a  GFP-53BP1  expression  vector  containing  the  minimal  foci  forming  domain 
of  53BP1  and  create  stable  cell  line  that  has  robust  inducible  foci  formation  following  DNA 
damage  (adriamycin  treatment). 

We  have  determined  conditions  for  measuring  53BP1  foci  using  an  alternative 
immunofluorescence  approach  that  gives  robust  foci  following  treatment  with  the  radiomimetic 
drug,  neocarzinostatin  (NCS).  In  collaboration  with  Dr.  Ty  Voss  of  the  High-Throughput 
Microscopy  Core  at  NIH,  we  have  been  able  to  visualize  the  foci  using  the  automated  Perkin- 
Elmer  Opera  platform  that  will  enable  screening  of  the  NCI  Diversity  Set  by  measuring  the 
suppression  of  the  appearance  of  53BP1  foci  by  any  potential  lead  compound.  Currently  we  are 
optimizing  this  system  with  appropriate  positive  and  negative  controls.  This  work  will  enable  us 
to  conduct  a  screen  of  the  NIH  Diversity  Set  of  small  molecules,  thereby  enabling  us  to  carry  out 
the  more  detailed  screening  approaches  entailed  in  Specific  Aim  3 . 


Key  Research  Accomplishments 

•  Deletion  of  the  DNA  damage  response  gene,  RIF1,  mimics  53BP1  deficiency  with  respect  to 
increased  resection  and  defective  class  switching. 

•  Genomic  instability  and  hypersensitivity  of  BRC A 1 -deficient  cells  to  inhibitors  of  polyADP- 
ribose  polymerase  (PARP  inhibitors,  PARPi)  is  not  suppressed  by  deletion  of  PALB2. 

•  Phosphorylation  of  the  N-terminus  of  53BP1  is  essential  for  the  role  of  53BP1  in  promoting 
genomic  instability  in  BRCA1 -deficient  cells  but  is  dispensable  for  class  switching. 
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•  PARPi  treatment  leads  to  genome  instability  at  preferred  genomic  sites. 

Reportable  Outcomes 

1 .  Three  high-impact  papers  have  been  published  or  are  in  press  based  on  this  work  with  the 
awardee,  Dr.  Andre  Nussenzweig  (M.  Di  Virgilio  et  al.  Science  2013.  C.  Bowman-Colin  et  al. 
PNAS  2013,  J.  Barlow  et  al.  Cell;  152:620-632,  2013).  One  additional  paper  has  been  submitted 
for  publication. 


Conclusion 


This  work  has  revealed  for  the  first  time  that  deletion  of  53BP1  prevents  genomic  instability  in  cells 
lacking  the  tumor  suppressor,  BRCA1.  53BP1  therefore  plays  a  key  role  in  cellular  changes  leading  to 
cancer  in  individuals  with  BRCA1  mutations.  This  validates  the  targeting  of  53BP1  as  a  chemopreventive 
measure  to  avert  the  appearance  of  cancer  in  women  with  mutations  in  BRCA1,  which  accounts  for  ~5% 
of  all  annual  cases  of  breast  cancer  and  a  higher  proportion  of  ovarian  cancers.  This  work  further 
suggests  that  53BP1  inactivation  or  deregulation  of  downstream  pathways  (eg.  RIF1)  is  a  potential 
mechanism  leading  to  resistance  of  BRC A 1 -deficient  tumors  to  chemotherapeutic  regimens  involving 
PARP  inhibitors. 
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higher  numbers  from  colon  contents  than  was 
the  nitrate  respiration-deficient  mutant  (Fig.  3H 
and  fig.  S8B).  Collectively,  these  data  suggested 
that  nitrate  respiration  conferred  a  marked  growth 
advantage  on  commensal  E.  coli  in  the  lumen 
of  the  inflamed  gut. 

The  picture  emerging  from  this  study  is  that 
nitrate  generated  as  a  by-product  of  the  host  in¬ 
flammatory  response  can  be  used  by  E.  coli ,  and 
likely  by  other  commensal  Enterobacteriaceae, 
to  edge  out  competing  microbes  that  rely  on  fer¬ 
mentation  to  generate  energy  for  growth.  Obli¬ 
gate  anaerobic  microbes  in  the  intestine  compete 
for  nutrients  that  are  available  for  fermentation 
but  cannot  use  nonfermentable  nutrients  (such 
as  fermentation  end  products).  The  ability  to 
degrade  nonfermentable  substrates  probably 
enables  E.  coli  to  sidestep  this  competition,  which 
explains  the  fitness  advantage  conferred  by  ni¬ 
trate  respiration  in  the  inflamed  gut.  Through 
this  mechanism,  inflammation  contributes  to  a 
bloom  of  nitrate-respiration-proficient  Entero¬ 
bacteriaceae,  providing  a  plausible  explanation 
for  the  dysbiosis  associated  with  intestinal  in¬ 
flammation  {3-12).  This  general  principle  might 
also  influence  the  dynamics  of  host-associated 


bacterial  communities  outside  the  large  bowel, 
as  nitrate  respiration  confers  a  fitness  advantage 
in  the  oxygen-poor  and  nitrate-rich  environment 
of  the  cystic  fibrosis  airway  {21). 
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Rifl  Prevents  Resection  of 
DNA  Breaks  and  Promotes 
Immunoglobulin  Class  Switching 

Michela  Di  Virgilio,1  Elsa  Callen,3*  Arito  Yamane,4*  Wenzhu  Zhang,5*  Mila  Jankovic,1 
Alexander  D.  Gitlin,1  Niklas  Feldhahn,1  Wolfgang  Resch,4  Thiago  Y.  Oliveira,1,6'7  Brian  T.  Chait,5 
Andre  Nussenzweig,3  Rafael  Casellas,4  Davide  F.  Robbiani,1  Michel  C.  Nussenzweig1,2t 

DNA  double-strand  breaks  (DSBs)  represent  a  threat  to  the  genome  because  they  can  lead  to 
the  loss  of  genetic  information  and  chromosome  rearrangements.  The  DNA  repair  protein 
p53  binding  protein  1  (53BP1)  protects  the  genome  by  limiting  nucleolytic  processing  of  DSBs 
by  a  mechanism  that  requires  its  phosphorylation,  but  whether  53BP1  does  so  directly  is 
not  known.  Here,  we  identify  Rapl-interacting  factor  1  (Rifl)  as  an  ATM  (ataxia-telangiectasia 
mutated)  phosphorylation-dependent  interactor  of  53BP1  and  show  that  absence  of  Rifl  results 
in  5'-3'  DNA-end  resection  in  mice.  Consistent  with  enhanced  DNA  resection,  Rifl  deficiency 
impairs  DNA  repair  in  the  Gi  and  S  phases  of  the  cell  cycle,  interferes  with  class  switch 
recombination  in  B  lymphocytes,  and  leads  to  accumulation  of  chromosome  DSBs. 


The  DNA  damage  response  factor  p53  bind¬ 
ing  protein  1  (53BP1)  is  a  multidomain  pro¬ 
tein  containing  a  chromatin-binding  tudor 
domain,  an  oligomerization  domain,  tandem  breast 
cancer  1  (BRCA1)  C-terminal  (BRCT)  domains, 
and  an  N-terminal  domain  with  28  SQ/TQ  poten¬ 
tial  phosphorylation  sites  for  phosphatidylinositol 
3-kinase-related  kinases  [PIKKs,  ataxia-telangiectasia 
mutated  (ATM)/ATM  and  Rad3-related/DNA- 
dependent  protein  kinase  catalytic  subunit  (DNA- 
PKcs)]  {1-3).  53BP1  contributes  to  DNA  repair 
in  several  ways:  This  protein  facilitates  joining 
between  intrachromosomal  double-strand  breaks 
(DSBs)  at  a  distance  (synapsis)  {4-7),  it  enables 
heterochromatic  DNA  repair  through  relaxa¬ 


tion  of  nucleosome  compaction  (2,  3),  and  it 
protects  DNA  ends  from  resection  and  thereby 
favors  repair  of  DSBs  that  occur  in  Gi  phase  by 
nonhomologous  end  joining  (NHEJ)  {4,  5,  8). 
Consistent  with  its  role  in  DNA-end  protection, 
53BP1  is  essential  for  class  switch  recombina¬ 
tion  (CSR)  in  B  lymphocytes  (9,  10). 

Structure-function  studies  indicate  that,  be¬ 
sides  the  recruitment  of  53BP1  to  DNA  ends, 
protection  requires  53BP1  phosphorylation  {4), 
but  how  this  protective  effect  is  mediated  is  un¬ 
known.  To  identify  phosphorylation-dependent 
interactors  of  53BP1,  we  applied  stable  isotope 
labeling  by  amino  acids  in  cell  culture  (SILAC). 
Trp53bpl~ A  {Trp53bpl  encodes  53BP1)  B  cells  were 


infected  with  retroviruses  encoding  a  C-terminal 
deleted  version  of  53BP1  (53BP1db)  or  a  phospho- 
mutant  in  which  all  28  N-terminal  potential  PUCK 
phosphorylation  sites  were  mutated  to  alanine 
(53BP1DB28A)  {4),  in  media  containing  isotopically 
heavy  (53BP1db)  or  light  (53BP1DB28A)  lysine  and 
arginine  (fig.  SI,  A  to  C)  {11). 

Most  proteins  coprecipitating  with  53BP1db 
and  53BP1DB28A  displayed  a  H/{H+  L)  ratio  of 
~0.5  {H,  heavy;  L,  light),  which  is  character¬ 
istic  of  phospho-independent  association  (av¬ 
erage  of  0.57  ±  0.09,  peptide  count:  at  least  four) 
(Fig.  1  and  table  SI).  Many  of  these  proteins 
are  nonspecific  contaminants,  but  others  such 
as  KRAB-associated  protein  1  (KAP-1),  dynein 
light  chain  LC8-type  1  (Dynlll),  Nijmegen  break¬ 
age  syndrome  1  (Nbsl),  and  H2AX  represent  au¬ 
thentic  phospho-independent  5 3 BP  1 -interacting 
proteins  (fig.  SID).  Three  proteins  displayed 
an  abundance  ratio  that  was  more  than  four 
standard  deviations  (SDs)  above  the  mean, 
indicating  that  these  proteins  interact  specifically 
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with  phosphorylated  53BP1:  Pax  interaction 
with  transcription-activation  domain  protein- 1 
(Paxipl,  or  PTIP;  0.95),  PTIP-associated  protein 
1  (Pal;  0.97),  and  Rap  1 -interacting  factor  1  (Rifl) 
(0.96)  (Fig.  1  and  figs.  SID  and  S2).  PTIP  was 
known  to  interact  with  53BP1  in  a  phospho- 
dependent  manner  (72),  whereas  Pal  and  Rifl 
were  not. 

Rifl  was  originally  identified  in  budding 
yeast  as  a  protein  with  a  key  role  in  telomere 
length  maintenance  (73).  However,  in  mam¬ 
malian  cells,  Rifl  is  not  essential  for  telomere 
homeostasis,  but  has  been  assigned  a  number  of 
different  roles  in  maintaining  genome  stability, 
including  participation  in  the  DNA  damage  re¬ 
sponse  (14-16),  repair  of  S-phase  DNA  damage 
(77,  18),  and  regulation  of  origin  firing  during 
DNA  replication  (79,  20).  However,  the  mech¬ 
anism  by  which  Rifl  might  contribute  to  DNA 
repair  and  maintenance  of  genome  stability  is  not 
known. 

To  confirm  that  Rifl  interaction  with  53BP1 
is  dependent  on  phosphorylation,  we  performed 
Western  blot  analysis  of  Flag  immunoprecipi- 
tates  from  lysates  of  irradiated  Trp53bpF 7  B 
cells  infected  with  retroviruses  encoding  53BP1db 
or  53BP1DB28A.  Whereas  Dynlll,  a  phospho- 
independent  53BP1  interactor  (SILAC  ratio:  0.55) 
(fig.  SID),  coimmunoprecipitated  with  53BP1db 
and  53BP1DB28A  to  a  similar  extent  (Fig.  2A), 
only  53BP1db  coimmunoprecipitated  with  Rifl. 
We  conclude  that  the  interaction  between  53BP1 
and  Rifl  is  dependent  on  phosphorylation  of 
53BP1. 

Ataxia-telangiectasia  mutated  phosphorylates 
53BP1  in  response  to  DSBs  (7,  3).  To  determine 
whether  ATM  induces  DNA  damage-dependent 
association  between  Rifl  and  53BP1,  we  com¬ 
pared  irradiated  and  nonirradiated  B  cells  in 
coimmunoprecipitation  experiments.  Although  we 
detected  small  amounts  of  Rifl  in  53BP1db  im- 
munoprecipitates  from  unirradiated  cells,  this 
was  increased  by  a  factor  of  >3  after  irradiation, 
and  the  increase  was  abrogated  by  treatment 
with  the  ATM  inhibitor  KU55933  (Fig.  2B).  We 
conclude  that  Rifl  preferentially  interacts  with 
phosphorylated  53BP1  in  a  DNA  damage-  and 
ATM-dependent  manner. 

Rifl  is  recruited  to  DNA  damage  foci  by 
53BP1  (75).  To  determine  whether  53BP1  phos¬ 
phorylation  is  required  for  Rifl  focus  formation, 
we  tested  Rifl  foci  in  irradiated  Trp53bpl~ /_  im¬ 
mortalized  mouse  embryonic  fibroblasts  (iMEFs), 
which  were  stably  transduced  with  either  53BP1db 
or  53BP1DB28A.  Rifl  foci  were  readily  detected  and 
colocalized  with  53BP1db  (Fig.  2C).  In  contrast, 
although  53BP1DB28A  formed  normal-appearing 
foci,  Rifl  foci  were  rare  and  did  not  colocalize 
with  53BP1  (Fig.  2C).  Furthermore,  Rifl  recruit¬ 
ment  to  ionizing  radiation-induced  foci  (IRIF) 
and  colocalization  with  53BP1  were  abrogated 
in  ATM-deficient  but  not  DNA-PKcs-deficient 
iMEFs  (fig.  S3)  (75).  We  conclude  that  Rifl  re¬ 
cruitment  to  DNA  damage  response  foci  is  de¬ 
pendent  on  AIM-mediated  53BP1  phosphorylation. 


The  phosphorylation  of  53BP1  is  essential  for 
CSR  (4).  To  examine  the  role  of  Rifl  in  joining 
DSBs  during  CSR,  we  conditionally  ablated 
Rifl  in  B  cells  using  CD19Cre,  which  is  ex¬ 
pressed  specifically  in  B  cells  (R ifl  F/FCd  1 9Cre/+ 
mice)  (fig.  S4,  A  to  C).  To  induce  CSR,  B  cells 
were  activated  with  lipopolysaccharide  (LPS) 
and  interleukin-4  (IL-4)  in  vitro,  and  switching 
to  immunoglobulin  G1  (IgGl)  or  IgG3  was  mea¬ 


sured  by  flow  cytometry.  CSR  to  IgGl  and  IgG3 
was  markedly  reduced  in  RiflF/FCdl9Cre/+  B 
cells,  but  less  so  than  in  Trp53bpl~/  controls 
(Fig.  3,  A  and  B,  and  fig.  S5).  Switch  junctions 
from  RiflF/FCdl9Cre/+  B  cells  were  comparable 
to  those  from  Trp53bpl  /  and  wild-type  con¬ 
trols  (fig.  S6)  (7),  which  indicates  that,  similar 
to  53BP1  deficiency,  absence  of  Rifl  does  not 
alter  the  nature  of  productive  CSR  joining  events. 


Fig.  1.  Identification  of  phospho-dependent  53BP1  interactors.  The  graph  shows  the  H/{H  +  L)  ratio 
distribution  of  proteins  identified  by  SILAC.  Error  bars  represent  the  SD  of  the  H/(H  +  L)  mean  value  for  all 
of  the  peptides  identified  for  each  individual  protein  (only  proteins  with  at  least  four  peptides  were 
included).  H/{H  +  L)  and  a  are  the  mean  (0.57)  and  SD  (0.09)  of  the  distribution,  respectively. 
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Fig.  2.  Rifl  interaction  with  53BP1  is  de¬ 
pendent  on  phosphorylation,  DNA  dam¬ 
age,  and  ATM.  (A)  Western  blot  analysis  of 
anti-Flag  immunoprecipitates  (IP)  from 
irradiated  (IR)  Trp53bplF~  B  lymphocytes 
infected  with  empty  vector  (vec),  53BP1db, 
or  53BP1DB28A  virus.  Triangles  indicate 
threefold  dilution.  Data  are  representa¬ 
tive  of  two  independent  experiments.  (B) 

Western  blot  analysis  of  anti-Flag  immu¬ 
noprecipitates  from  Trp53bpl~/~  B  cells 
infected  with  empty  vector  or  53BP1db. 

Cells  were  either  left  untreated  or  irradiated  [50  gray  (Gy),  45-min  recovery]  in  the  presence  or  absence  of 
the  ATM  kinase  inhibitor  KU55933  (ATMi).  Triangles  indicate  threefold  dilution.  Data  are  representative  of 
two  independent  experiments.  (C)  Immunofluorescent  staining  for  53BP1  (Flag)  and  Rifl  in  irradiated 
Trp53bpl^~  iMEFs  reconstituted  with  53BP1db  or  53BP1DB28A  retroviruses  (4).  Magnification,  lOOx;  scale 
bars,  5  jam.  Data  are  representative  of  two  independent  experiments.  DAPI,  4',6-diamidino-2-phenylindole. 


712 


8  FEBRUARY  2013  VOL  339  SCIENCE  www.sciencemag.org 


Downloaded  from  www.sciencemag.org  on  April  22,  2013 


REPORTS 


LPS  &  IL-4 


B  _ Cd19Cre/+ _ 

Rif1+/+  Rif1F/+  Rif1F/F 


& 


Cd19Cre/+ 


Cd19Cre/+ 


LPS 


80% 


0% 


□  WT 
|  Cd19Cre/+ 

|  Rif1F/FCd  1 9Cre/+ 


m  Chre  breaks  (Igh) 

E  B  Chre  breaks  (non -Igh) 

□  Chtid  breaks  (no n-lgh) 

□  Radial  chre 


Cd19Cre/+ 


F 

i  ¥  ■ v 

§ 

1 

O 

♦.  4 

"S3 

O 

‘▼ST 

1 

m  0 

Cd19c 


Rif1+/+ 


Rif1F/F 


—  Cd19Cre/+ 

-  Rif1F/FCd19Cre/+ 


62.4 


'0,92  22.5 


fo.77  22.9 


0  5  10  15  20 


0  5  10  15  20 


]0.99  17-2 


59  11-1 


'4.46  21.4 


121.8  29.9 


Cd19c 


Fig.  3.  Rifl  deficiency  impairs  CSR  and  causes  Igh  and  genome  instability  in 
primary  B  cells.  (A)  (Left)  CSR  to  IgGl  96  hours  after  stimulation  of  B  lymphocytes 
with  LPS  and  IL-4.  (Right)  Summary  dot  plot  for  three  independent  experiments 
(n  =  three  mice  per  genotype).  Mean  values  are:  23.6%  for  Cdl9Cre/+,  23.4%  for 
RiflF/+Cdl9Cre/+ ,  and  5.0%  for  RiflF/FCdl9Cre/+  ( P  <  0.008  with  the  paired 
Student's  t  test).  (Bottom)  B  cell  proliferation  by  carboxyfluorescein  succinimidyl 
ester  (CFSE)  dilution.  Data  are  representative  of  three  independent  experiments. 
(B)  Same  as  in  (A)  but  for  CSR  to  lgG3  after  stimulation  with  LPS  alone.  Mean 
values  are:  3.2%  for  Cdlfre/+,  3.4%  for  RiflF/+Cdltfre/+ ,  and  0.5%  for  RiflF/FCdlfre,+ 
(P  <  0.008).  (0  (Left)  Cell  cycle  analysis  of  primary  B  cells  after  stimulation  with 
LPS  and  IL-4.  BrdU,  5-bromo-2'-deoxyuridine;  7-AAD,  7-amino-actinomycin  D. 
(Right)  Summary  histograms  for  S,  Go/Gi,  and  G2/M  phase  cells  from  two 
independent  experiments  ( n  =  four  mice  per  genotype).  Error  bars  indicate  SEM. 


*  0.01  <  P  <  0.05,  **  0.001  <  P  <  0.01,  ***  P  <  0.001.  WT,  wild  type.  (D)  (Left)  Cell 
cycle  analysis  of  LPS-  and  IL-4-stimulated  splenocytes  at  the  indicated  times  after 
irradiation  (6  Gy).  (Right)  Summary  graphs  for  S,  Go/G;l,  and  G2/M  phase  cells  from 
two  independent  experiments  ( n  =  three  mice  per  genotype).  Error  bars  indicate  SD. 
(E)  Analysis  of  genomic  instability  in  metaphases  from  B  cell  cultures.  Chtid, 
chromatid;  Chre,  chromosome.  Data  are  representative  of  two  independent 
experiments  (n  =  50  metaphases  analyzed  per  genotype  per  experiment).  (F) 
Examples  of  /^-associated  aberrations  in  RiflF/FCdl9Cre/+  B  cells.  Chromo¬ 
somes  were  hybridized  with  an  Igh  Ca  probe  (green;  centromeric  of  Cyl)  and  a 
telomere  sequence-specific  probe  (red)  and  were  counterstained  with  DAPI  (dark 
blue/black).  Arrows  indicate  Igh  Ca/telomeric  signal  on  chromosome  12.  Mag¬ 
nification,  63x;  scale  bars,  1  pm.  (G)  Frequency  of  c-myc/lgh  translocations  in 
activated  B  cells.  The  graph  shows  combined  results  from  three  mice  per  genotype. 
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A  similar  CSR  defect  was  also  obtained  by 
conditionally  deleting  Rifl  with  4-hydroxy- 
tamoxifen  (4HT)  in  RiflF/FROSA26Cre~ERT2/+ 
B  cells  (fig.  S7).  Finally,  short  hairpin  RNA- 
mediated  partial  down-regulation  of  CtBP- 
interacting  protein  (CtfP),  which  interacts  with 
Rifl  (fig.  S8C)  and  has  been  implicated  in  pro¬ 
cessing  of  DNA  ends  (21,  22),  resulted  in  a  very 
small  but  reproducible  increase  in  CSR  (fig.  S8, 
A  and  B).  Thus,  Rifl  is  essential  for  normal  CSR, 
and  CtfP  may  not  be  the  only  factor  that  contrib¬ 
utes  to  end  processing  in  Rifl -deficient  B  cells. 


Class  switch  recombination  requires  cell  di¬ 
vision,  activation- induced  cytidine  deaminase 
(AID)  expression,  and  Igh  germline  transcription 
(23).  There  are  conflicting  reports  that  Rifl  is 
required  for  proliferation  in  MEFs,  but  not  in 
DT40  B  cells  (17,  18).  We  found  that  cell  divi¬ 
sion  profiles  of  RiflF/FCdl9Cre/+  and  4HT-treated 
RiflF/FROSA26Cre~ERT2/+  B  cells  were  indistin¬ 
guishable  from  controls  (Fig.  3,  A  and  B;  and  fig. 
S7,  A,  C,  E,  and  G),  indicating  that  Rifl  is  dis¬ 
pensable  for  B  cell  proliferation  in  vitro.  Finally, 
AID  mRNA  and  protein  expression  and  Igh  germ¬ 


line  transcription  were  not  affected  by  Rifl  de¬ 
letion  (fig.  S4,  B  and  D). 

We  next  examined  the  role  of  Rifl  in  cell 
cycle  progression  in  primary  B  cells.  We  found 
no  major  differences  in  the  percentage  of  cells 
in  G0/Gi  and  S  phases  (Fig.  3C).  However,  the 
number  of  cells  in  G2/M  phase  was  increased 
approximately  twofold  in  the  absence  of  Rifl 
(2.64-,  2.56-,  and  1.91-fold  at  48, 72,  and  96  hours, 
respectively)  (Fig.  3C).  We  obtained  similar 
results  with  the  use  of  RiflF/FROSA26Cre~ERT2/+ 
B  cells  treated  with  4HT  (fig.  S7,  H  and  I). 
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Fig.  4.  Rifl  prevents  resection  of  DNA  ends  at  sites  of  AID-induced  DNA 
damage.  (A  to  D)  RPA  and  Rad51  occupancy  at  the  Igh  locus  (A  and  0 
and  at  non -Igh  AID  targets  genes  (B  and  D)  in  B  cells  activated  to  un¬ 
dergo  class  switching.  ChIP-seq  libraries  were  resolved  into  upper  (+)  and 
lower  (-)  DNA  strands  to  show  RPA  and  Rad51  association  with  sense  and 
antisense  strands.  Within  a  specified  genomic  window,  graphs  have  the 
same  scale  and  show  tag  density.  Deep-sequencing  samples  were  nor¬ 
malized  per  library  size,  and  tags  per  million  values  were  calculated  for 


each  genic  region,  as  indicated  in  the  supplementary  materials  and  meth¬ 
ods  and  shown  in  parenthesis.  Data  are  representative  of  two  indepen¬ 
dent  experiments  for  RPA  ChIP-seq  and  one  for  Rad51.  (E)  Model  of  Rifl 
recruitment  and  DNA-end  protection  at  DSBs.  DNA  damage  activates 
ATM,  which  phosphorylates  many  targets,  including  53BP1.  This  event 
recruits  Rifl  to  53BP1  at  the  DSB,  where  it  inhibits  DNA  resection.  The 
extensive  resection  in  the  absence  of  Rifl  impairs  CSR  at  the  Igh  locus.  P, 
phosphate. 
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Furthermore,  irradiation  increases  the  accumulation 
of  Rif  lF/FCdl9Cre/+  B  cells  in  G2/M  phase  (Fig. 
3D).  In  addition,  Trp53bpl  7  iMEFs  expressing 
53BP1DB28A,  which  did  not  recmit  Rifl  to  IRIF 
(Fig.  2C),  exhibited  delayed  progression  through 
S  phase  following  DNA  damage  with  accumula¬ 
tion  of  cells  in  G2  phase  after  irradiation  (fig.  S9). 

Accumulation  of  cells  in  G2/M  phase  may 
reflect  the  persistence  of  unrepaired  DNA  dam¬ 
age  in  a  fraction  of  Rifl -deficient  cells.  To  investi¬ 
gate  this  possibility,  we  analyzed  metaphase  spreads 
from  B  cells  dividing  in  response  to  LPS  and 
IL-4  in  vitro.  These  cells  express  AID,  which 
produces  DSBs  in  Igh  and,  less  frequently  at  off- 
target  sites  throughout  the  genome,  in  the  Gi 
phase  of  the  cell  cycle  {24-26).  Chromosomal 
aberrations  were  increased  in  RiflF/FCdl9Cre/+ 
B  cells  compared  to  controls  (Fig.  3E),  with  many 
localized  to  the  Igh  locus  (Fig.  3E).  Consistent 
with  the  observation  that  Igh  is  targeted  by  ATP 
in  the  G  |  phase  of  the  cell  cycle,  all  of  the  Igh 
breaks  were  chromosome  breaks  (Fig.  3,  E  and  F). 
Interestingly,  the  frequency  of  c-myc/Igh  translo¬ 
cations  is  moderately  increased  in  RiflF/FCdl9Cre/+ 
B  cells;  however,  the  breakpoint  distribution  was 
similar  to  the  Cdl9Cre/+  control  (1.5  x  1CT6  ver¬ 
sus  1.0  x  10  6  in  the  control;  P=  0.039)  (Fig.  3G 
and  fig.  S10).  We  conclude  that  in  the  absence  of 
Rifl,  DSBs  fail  to  be  resolved  efficiently  in  the 
Gi,  S,  or  G2  phases,  which  leads  to  increased 
levels  of  genomic  instability,  including  chromo¬ 
some  breaks  at  Igh  and  translocations  in  dividing 
B  cells. 

In  the  absence  of  53BP1,  DSBs  produced 
by  AID  at  the  Igh  locus  accumulate  the  single- 
stranded  DNA-binding  replication  protein  A  com¬ 
plex  (RPA)  as  a  result  of  increased  DNA-end 
resection  {24).  To  determine  if  Rifl  is  required 
for  DNA-end  protection  by  53BP1 ,  we  performed 
RPA-chromatin  immunoprecipitation  followed 
by  massive  parallel  sequencing  (ChIP-seq)  ex¬ 
periments  on  RiflF/FCdl9Cre/+  and  control  B  cells. 
Ablation  of  Rifl  was  indistinguishable  from 
the  loss  of  53BP1  in  that  in  its  absence,  RPA  dec¬ 
orates  the  Igh  locus  asymmetrically,  in  a  manner 
consistent  with  5 '-3'  resection  (Fig.  4A)  {27).  In 
addition,  absence  of  Rifl  also  results  in  RPA 
accumulation  at  non  -Igh  genes,  such  as  114  ra  and 
Piml ,  that  are  damaged  by  AID  in  Gi  phase  (Fig. 
4B)  {24,  25).  Rad51  is  the  recombinase  that 
mediates  repair  of  DSBs  by  homologous  recom¬ 
bination  in  S/G2/M  phase  {22).  To  confirm  that 
Rifl  prevents  resection  that  takes  place  in  S 
phase,  we  monitored  Rad5 1  accumulation  in  ac¬ 
tivated  B  cells  by  ChIP-seq.  Loss  of  Rifl  was 


indistinguishable  from  the  loss  of  53BP1  (27),  in 
that  it  led  to  asymmetric  Rad5 1  accumulation  at 
sites  of  AID-inflicted  DNA  damage  (Fig.  4,  C 
and  D).  We  conclude  that  in  the  absence  of  Rifl, 
AID-induced  DSBs  incurred  in  Gi  phase  persist 
and  undergo  extensive  5 '-3'  DNA-end  resection 
in  S/G2/M  phase,  as  measured  by  RPA  and  Rad5 1 
accumulation. 

A  role  for  Rifl  in  maintenance  of  genome 
stability  and  protection  of  DNA  ends  against  re¬ 
section  is  consistent  with  its  phosphorylation- 
dependent  recruitment  to  the  N-terminal  domain 
of  53BP1  {4).  53BP1  facilitates  DNA  repair  and 
prevents  DNA-end  resection  during  CSR.  In  the 
absence  of  53BP1,  AID-induced  DSBs  are  re¬ 
solved  inefficiently  in  Gi  phase,  leading  to  chro¬ 
mosome  breaks,  Igh  instability,  and  resolution  by 
alternative  NHEJ  or  homologous  recombination 
instead  of  classical  NHEJ  (■ 4 ,  8 ,  27).  Our  exper¬ 
iments  show  that  in  the  absence  of  Rifl,  53BP1  is 
insufficient  to  promote  genomic  stability  or  me¬ 
diate  efficient  Igh  repair,  DNA-end  protection,  or 
CSR.  Thus,  these  53BP1  activities  require  Rifl 
recruitment  to  the  phosphorylated  N  terminus  of 
53BP1 .  Rifl  is  likely  to  have  additional  functions 
beyond  53BP1,  CSR,  and  DNA-end  protection 
because  although  Trp53bpl~ 7  mice  are  viable, 
Rifl  deletion  is  lethal  {17).  Indeed,  Rifl  is  be¬ 
lieved  to  play  a  role  in  the  repair  of  S-phase  DNA 
damage  (77,  18),  as  well  as  in  the  regulation  of 
replication  timing  (79,  20,  28).  Analogously,  ad¬ 
ditional  CSR  factor(s)  may  exist  downstream  of 
53BP1,  as  class  switching  in  Rifl-deficienct  B 
cells  is  significantly  higher  than  in  Trp53bpl~/~ . 

In  summary,  our  data  are  consistent  with  a 
model  in  which  ATM-mediated  phosphorylation 
of  53BP1  recmits  Rifl  to  sites  of  DNA  damage, 
where  it  facilitates  DNA  repair  in  part  by  pro¬ 
tecting  DNA  ends  from  resection  (Fig.  4E).  In  the 
absence  of  Rifl,  DNA  breaks  incurred  in  Gi 
phase  fail  to  be  repaired  by  NHEJ  and  undergo 
extensive  5 '-3'  end  resection,  resulting  in  the  ac¬ 
cumulation  of  chromosome  breaks  and  genome 
instability. 
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SUMMARY 

DNA  double-strand  breaks  (DSBs)  in  B  lymphocytes 
arise  stochastically  during  replication  or  as  a  result  of 
targeted  DNA  damage  by  activation-induced  cyti- 
dine  deaminase  (AID).  Here  we  identify  recurrent, 
early  replicating,  and  AID-independent  DNA  lesions, 
termed  early  replication  fragile  sites  (ERFSs),  by 
genome-wide  localization  of  DNA  repair  proteins 
in  B  cells  subjected  to  replication  stress.  ERFSs 
colocalize  with  highly  expressed  gene  clusters  and 
are  enriched  for  repetitive  elements  and  CpG 
dinucleotides.  Although  distinct  from  late-replicating 
common  fragile  sites  (CFS),  the  stability  of  ERFSs 
and  CFSs  is  similarly  dependent  on  the  replication- 
stress  response  kinase  ATR.  ERFSs  break  spon¬ 
taneously  during  replication,  but  their  fragility  is 
increased  by  hydroxyurea,  ATR  inhibition,  or  deregu¬ 
lated  c-Myc  expression.  Moreover,  greater  than 
50%  of  recurrent  amplifications/deletions  in  human 
diffuse  large  B  cell  lymphoma  map  to  ERFSs.  In 
summary,  we  have  identified  a  source  of  sponta¬ 
neous  DNA  lesions  that  drives  instability  at  preferred 
genomic  sites. 


INTRODUCTION 

Double-strand  breaks  (DSBs)  arise  spontaneously  during  DNA 
replication,  as  a  result  of  oncogenic  stress,  and  as  a  part  of  the 
gene  diversification  programs  in  lymphocytes  (Bartek  et  al., 


2007;  Callen  et  al.,  2007;  Gostissa  et  al.,  2011;  Halazonetis 
et  al.,  2008).  When  B  lymphocytes  are  activated,  they  undergo 
rapid  proliferation  and  simultaneously  initiate  two-genome  re¬ 
modeling  reactions,  termed  somatic  hypermutation  (SHM)  and 
class  switch  recombination  (CSR).  The  coupling  of  rapid  cycling 
and  programmed  DNA  damage  poses  the  B  cell  genome  at  high 
risk  for  destabilization. 

SHM  introduces  point  mutations  in  the  variable  region  of 
immunoglobulin  (Ig)  genes,  which  can  increase  antibody  affinity, 
whereas  CSR  is  a  DNA  deletion  event  that  replaces  one  Ig 
constant  region  gene  for  another.  Both  of  these  reactions  are 
initiated  by  the  enzyme  activation-induced  cytidine  deaminase 
(AID),  which  deaminates  cytosine  residues  in  single-stranded 
DNA  exposed  during  Ig  gene  transcription  (Chaudhuri  and  Alt, 
2004).  In  addition  to  Ig  genes,  AID  causes  a  considerable  amount 
of  collateral  genomic  damage  (Chiarle  et  al.,  2011;  Kato  et  al., 
2012;  Klein  et  al.,  2011;  Liu  et  al.,  2008),  including  oncogenic 
targets  such  as  c-Myc  (Robbiani  et  al.,  2008).  Nevertheless, 
many  recurrent  mutations  in  B  cell  lymphoma  are  not  associated 
with  AID  activity,  and  the  mechanisms  of  rearrangements  at 
these  sites  remain  unclear. 

The  DNA  damage  response  (DDR)  is  activated  during  pro¬ 
grammed  rearrangements  in  lymphocytes  to  ensure  faithful 
DNA  repair  and  prevent  chromosomal  translocation  (Chen 
et  al.,  2000;  Petersen  et  al.,  2001).  The  DDR  is  also  triggered 
by  aberrant  oncogene  expression  that  induces  precocious  entry 
into  S  phase  and  perturbs  replication  fork  progression  (Bartek 
et  al.,  2007;  Bester  et  al.,  201 1 ;  Halazonetis  et  al.,  2008).  Replica¬ 
tion  fork  instability  can  also  be  triggered  by  exogenous  agents 
such  as  hydroxyurea  (HU),  which  depletes  deoxynucleotide 
pools,  or  by  deficiencies  in  homologous  recombination  path¬ 
ways  that  are  needed  to  complete  DNA  replication  after  fork 
stalling  or  collapse  (Schlacher  et  al.,  2012). 
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Oncogenic  stress  has  been  shown  to  preferentially  target 
genomic  regions  called  common  fragile  sites  (CFSs)  (Bartek 
et  al.,  2007;  Halazonetis  et  al.,  2008).  Historically,  CFSs  have 
been  mapped  in  lymphocytes  but  are  induced  in  all  cell  types 
under  conditions  that  obstruct  replication,  such  as  treatment 
with  low  doses  of  the  DNA  polymerase  inhibitor  aphidicolin. 
DNA  breakage  within  CFSs  spans  megabase  regions.  Neverthe¬ 
less,  CFSs  share  characteristic  features  including  association 
with  very  large  genes,  enrichment  of  long  stretches  of  AT  dinu- 
cleotide-rich  repeats,  and  incomplete  DNA  replication  (Durkin 
and  Glover,  2007). 

Replication-stress-induced  DNA  damage  is  also  observed  in 
yeast.  Similar  to  CFSs,  sites  located  in  “replication  slow  zones” 
(RSZs)  are  late  replicating  and  breakage  prone  (Cha  and  Kleck- 
ner,  2002).  In  addition  to  late  replicating  areas,  irreversible  repli¬ 
cation  fork  collapse  in  response  to  acute  doses  of  hydroxyurea 
has  been  observed  preferentially  around  a  subset  of  early  firing 
replication  origins  in  yeast  (Raveendranathan  et  al.,  2006),  which 
do  not  overlap  with  RSZs  (Cha  and  Kleckner,  2002;  Hashash 
et  al.,  2011).  Although  the  molecular  mechanisms  governing 
replication  initiation  in  yeast  and  mammalian  cells  are  distinct, 
we  wondered  if  fragility  at  early  firing  origins  is  also  a  feature  of 
mammalian  cells.  Here,  we  identify  highly  unstable  regions  of 
the  B  cell  genome  designated  as  “early  replicating  fragile  sites” 
(ERFSs).  We  propose  that  ERFSs  are  a  new  class  of  fragile  sites 
in  mammalian  cells  that  contribute  to  recurrent  rearrangements 
during  lymphomagenesis. 

RESULTS 

Genome-wide  Mapping  of  Replication-Induced  DNA 
Damage 

Single-strand  DNA  (ssDNA)  mapping  has  been  used  to  localize 
origins  of  replication  in  yeast  (Feng  et  al.,  2006).  To  identify 
potential  sites  of  fork  collapse,  we  first  profiled  the  location 
and  extent  of  ssDNA  genome-wide  using  chromatin  immunopre- 
cipitation  (ChIP)  with  an  anti-replication  protein  A  (RPA)  antibody 
(Figure  1).  RPA  associates  with  ssDNA  at  stalled  forks  near  early 
firing  origins  when  fork  movement  is  inhibited  by  HU  (Tanaka  and 
Nasmyth,  1998). 

Freshly  isolated  mouse  B  cells  are  arrested  in  the  G0  phase  of 
the  cell  cycle  (Figure  1A).  Upon  stimulation  with  LPS/IL4,  cells 
synchronously  enter  into  the  cell  cycle  so  that  by  22  hr,  approx¬ 
imately  8%  of  cells  have  entered  S  phase,  whereas  at  28  hr  over 
30%  are  in  S/G2  phases  (Figure  1A).  To  profile  early  replication 
origins,  we  treated  cells  at  22  hr  with  10  mM  HU  for  6  hr  to  fully 
arrest  cells  at  G-|/S  (Figures  1A  and  IB).  We  then  performed 
ChIP-seq  of  RPA  in  both  untreated  and  HU-treated  cells  at 
28  hr  (Figures  1A  and  IB).  Two  independent  experiments 
showed  reproducibility  of  genome-wide  RPA  association  in 
HU-treated  cells  (Figure  SI  A  available  online).  We  generated 
profiles  of  RPA  in  untreated  and  treated  samples,  centered  on 
individual  RPA-bound  sites  (Figure  SI  B),  and  observed  a  marked 
increase  in  the  intensity  of  RPA  in  HU-treated  B  cells  relative  to 
untreated  cells  where  5,939  out  of  11,942  genomic  regions 
(49.7%)  displayed  more  than  a  4-fold  increase  in  RPA  recruit¬ 
ment.  In  addition  to  the  53%  overlap  of  RPA-associated  regions 
between  HU-untreated  versus  -treated  cells,  we  also  observed 


that  1 ,441  regions  were  present  only  in  HU-treated  samples  (Fig¬ 
ure  SI  B).  These  HU-dependent  ssDNA  regions  may  correspond 
to  the  firing  of  new  replication  origins  to  compensate  for  ineffi¬ 
cient  replication. 

To  confirm  that  RPA  recruitment  maps  early  replication  zones, 
we  used  the  Repli-Seq  approach  (Hansen  et  al.,  201 0)  to  identify 
replication  origins  in  B  cells  during  HU  arrest.  Approximately 
12,000  early  activating  replication  origins  across  the  murine  B 
cell  genome  were  identified  (Figure  SI  C).  By  comparing  the  distri¬ 
bution  of  BrdU  incorporation  relative  to  the  individual  RPA-occu- 
pied  genomic  regions,  we  observed  association  of  BrdU  incorpo¬ 
ration  with  nearly  80%  of  RPA-bound  regions  (Figure  SIC). 
Moreover,  more  than  86%  of  RPA/BrdU  enriched  genomic  sites 
coincided  with  previously  mapped  early  replicating  regions  in 
the  mouse  B  cell  line  CHI 2  (Stamatoyannopoulos  et  al.,  2012) 
(p(permutation)  <  1  x  10-5,  Figure  SID).  Thus,  HU-arrested  B 
cells  exhibited  an  enrichment  of  RPA  at  early  replicating  zones, 
consistent  with  an  early  S  phase  cell-cycle  arrest  (Figure  1  A). 

Early  replicating  regions  are  associated  with  accessible  chro¬ 
matin  configuration  (MacAlpine  et  al.,  2004).  In  agreement  with 
this,  we  found  that  more  than  67%  of  RPA-bound  regions  in 
HU-arrested  cells  reside  within  intragenic  sequences  (Figures 
S1E  and  SI  I),  a  frequency  significantly  higher  than  expected 
(p(permutation)  <  1  x  10-5).  Moreover,  RPA  preferentially  asso¬ 
ciated  with  DNasel  hypersensitive  sites  (DHS)  and  euchromatic 
promoters  marked  by  H3K4me3  (Figure  S1F).  Finally,  we 
measured  transcriptional  activity  in  HU-treated  B  cells  directly 
by  genome-wide  RNA  sequencing.  We  observed  high  transcrip¬ 
tion  activity  within  the  RPA-occupied  genomic  regions  as  shown 
by  the  aggregated  pattern  of  RNA-Seq  centered  on  those 
regions  (Figure  S1G).  Moreover,  6,100  RPA-bound  RefSeq 
genes  exhibited  significantly  higher  average  mRNA  abundance 
than  those  that  did  not  show  RPA  binding  (p  <  1  x  10-16,  Fig¬ 
ure  S1H).  Thus,  HU-induced  RPA  recruitment  in  early  S  phase 
maps  to  actively  transcribed  genes  that  show  the  hallmarks  of 
euchromatin. 

Replisome  stalling  in  response  to  HU  triggers  the  activation  of 
the  ATR  kinase  (Ward  and  Chen,  2001),  which  protects  forks 
from  collapse  (Cimprich  and  Cortez,  2008),  and  leads  to  phos¬ 
phorylation  of  H2AX  (y-H2AX)  (Ward  and  Chen,  2001),  which 
colocalizes  with  RPA  (Petermann  et  al.,  2010).  To  examine  the 
relative  distribution  of  y-H2AX  and  RPA  genome-wide,  we 
carried  out  ChIP-seq  with  an  antibody  that  recognizes  y-H2AX 
(Figure  SI  A)  and  examined  their  profiles  with  respect  to  the 
center  of  RPA-bound  sites.  y-H2AX-associated  genomic 
regions  were  much  broader  than  RPA,  but  these  regions  overlap¬ 
ped  with  93%  of  RPA-bound  sites  marking  ssDNA  in  HU-treated 
cells  (Figure  1C),  consistent  with  the  finding  that  y-H2AX  marks 
stalled  forks  even  prior  to  DSB  formation  (Petermann  et  al., 
2010).  y-H2AX/RPA  enriched  loci  may  therefore  correspond  to 
a  combination  of  stalled  and  broken  replisomes. 

Cells  deficient  in  homologous  recombination  (HR)  pathway 
components,  such  as  XRCC2,  often  accumulate  spontaneous 
chromosome  breaks  and  exhibit  hypersensitivity  to  HU  (Sonoda 
et  al.,  1998).  Consistent  with  increased  spontaneous  DNA 
damage  at  replication  forks,  untreated  XRCC2~/~  cells  exhibited 
accumulation  of  y-H2AX  at  similar  genomic  regions  and  at 
almost  similar  levels  observed  in  HU-treated  wild-type  (WT) 
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Figure  1.  Mapping  Replication-Induced  DNA  Damage  in  Murine  B  Lymphocytes 

(A)  FACS  analysis  showing  DNA  content  of  freshly  isolated  and  ex  vivo  stimulated  splenic  murine  B  lymphocytes  in  the  absence  and  presence  of  10  mM  HU. 

(B)  Experimental  plan  describing  cell  synchronization  and  isolation  for  samples  used  in  ChIP-seq  and  RNA-Seq  experiments. 

(legend  continued  on  next  page) 
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B  cells  (Figures  S2A-S2C).  90%  of  y-H2AX-associated  genomic 
regions  in  untreated  XRCC2~'~  cells  correlate  with  the  regions 
enriched  for  this  protein  in  HU-treated  WT  B  cells  (Figure  S2B), 
and  nearly  80%  of  the  regions  with  enriched  y-H2AX  observed 
in  HU-treated  WT  B  cells  overlapped  with  those  seen  in  HU- 
treated  XRCC2~/~  cells  (Figure  S2C).  These  data  indicate  that 
XRCC2  deficiency  leads  to  increased  endogenous  levels  of 
replication  stress  mostly  at  the  same  loci  where  HU  induces 
replication  fork  stalling  and/or  breakage  in  WT  cells. 

RPA,  BRCA1,  and  SMC5  Colocalization  Marks  the  Sites 
of  Replication  Stress  in  Early  Replicating  Zones 

Like  XRCC2,  BRCA1  and  members  of  the  structural  mainte¬ 
nance  of  chromosome  (SMC)  family  have  been  implicated 
in  promoting  replication  fork  restart  (Schlacher  et  al.,  2012; 
Stephan  et  al.,  2011).  To  determine  whether  HR  proteins  bind 
to  a  subset  of  stalled  forks  marked  by  RPA  and  y-H2AX,  we 
also  defined  the  genome-wide  profile  of  BRCA1  and  SMC5. 
We  confirmed  BRCA1  and  SMC5  ChIP-seq  efficacy  by 
observing  their  association  at  both  Sfi  and  Syl  in  53BP7_/_ 
cells,  where  the  breaks  in  IgH  persist  unrepaired  and  undergo 
extensive  resection  (Figure  S3A)  (Bothmer  et  al.,  2010;  Bunting 
et  al.,  201 0,  201 2;  Yamane  et  al.,  201 1 , 201 3). 

We  then  determined  the  localization  of  BRCA1  and  SMC5  in 
HU-arrested  B  cells.  Two  independent  experiments  showed 
reproducibility  of  genome-wide  BRCA1  and  SMC5  association 
(Figures  S3B  and  S3C).  To  identify  the  RPA  genomic  sites  co¬ 
occupied  by  the  HR  proteins  BRCA1  and  SMC5,  we  plotted 
the  distribution  of  their  binding  with  respect  to  the  center  of 
individual  RPA-bound  regions.  Overall,  2,204  regions  spanning 
1 0  kbp  on  average  showed  RPA/BRCA1/SMC5  triple  colocaliza¬ 
tion  (Figures  ID  and  IE).  We  found  that  RPA  was  recruited  to 
more  than  88%  of  genomic  sites  exhibiting  BRCA1  and  SMC5 
association  (Figure  1 E).  Furthermore,  genome-wide  analysis  of 
RPA/BRCA1/SMC5  profiles  in  untreated  cells  revealed  more 
than  a  21  %  increase  in  the  number  of  genomic  regions  occupied 
by  these  three  proteins  after  HU  treatment  (Figure  S4A).  Never¬ 
theless,  48%  of  RPA/BRCA1/SMC5  triple  colocalizations  were 


common  between  the  unperturbed  and  HU-arrested  B  cells 
(Figure  S4A).  Therefore,  we  hypothesized  that  chromatin 
with  concomitant  RPA,  BRCA1 ,  and  SMC5  binding  might  corre¬ 
spond  to  regions  undergoing  replication  fork  collapse  both  in 
response  to  replication  stress  and  during  normal  DNA  replica¬ 
tion.  Given  that  our  analysis  focused  on  early  replicating  sites, 
which  contrasts  with  late  replicating  CFSs,  we  designated  these 
regions  as  ERFSs. 

We  then  characterized  ERFSs  to  determine  whether  they 
share  common  underlying  primary  sequence  characteristics. 
Indeed,  these  loci  were  enriched  at  known  repetitive  elements, 
including  LINE  L2,  SINE,  DNA  transposons,  and  tRNA  elements 
(p(permutation)  <  1  x  1CT3,  Figure  1 F),  which  are  known  replica¬ 
tion  fork  barriers  (Mirkin  and  Mirkin,  2007).  Furthermore,  ERFSs 
showed  significantly  higher  G  and  C  nucleotide  content 
compared  to  the  whole  mouse  genome,  in  contrast  to  CFSs 
that  are  enriched  in  A+T  sequences  (p(Wilcoxon)  <  1  x  10-16, 
Figure  S4B).  Twenty-six  percent  of  the  ERFSs  regions  overlap¬ 
ped  with  CpG  islands,  which  are  highly  enriched  at  translocation 
breakpoints  in  B  cell  lymphoma  (Tsai  et  al.,  2008).  Conversely, 
CpG  islands  covered  approximately  400,000  nucleotides 
within  these  regions  (p(permutation)  <  1  x  10-5,  Figure  1G).  As 
anticipated,  ERFSs  clustered  at  early  replication  origins  (Fig¬ 
ure  S4C),  and  over  66%  of  the  loci  overlapped  with  intragenic 
or  promoter  sequences  of  RefSeq  annotated  protein  coding 
genes  (p(permutation)  <  1  x  1 0-3,  Figures  S4D  and  S4E). 
Moreover,  ERFSs  are  more  transcriptionally  active  relative  to 
flanking  genomic  regions  shown  by  relative  mRNA  enrichment 
by  RNA-Seq  (Figure  1 H).  Indeed,  more  than  86%  of  the  RefSeq 
annotated  genes  with  ERFSs  are  among  the  highest  transcribed 
genes  (p(binomial)  <1x1 0-16,  Figure  S4F).  Finally,  ERFSs  were 
significantly  enriched  in  gene  pairs  that  are  transcribed  in 
converging  or  diverging  directions  (see  Experimental  Proce¬ 
dures),  such  as  the  convergent  transcription  pair  of  IKZF1  and 
FIGNL1  shown  in  Figure  2A.  Compared  to  expected  values, 
ERFSs  were  at  least  two  times  more  likely  to  localize  in  regions 
containing  gene  pairs  exhibiting  convergent  and/or  divergent 
gene  pairs  (p(permutation)  <  1  x  1CT5,  Figure  II). 


(C)  For  each  RPA-bound  site  in  response  to  1 0  mM  HU  (y  axis),  each  column  depicts  the  presence  of  RPA  (left)  and  y-H2AX  (right)  within  a  window  centered  on  the 
RPA-bound  sites.  Color  map  corresponds  to  binding  intensities  where  “black”  represents  no  binding.  K-mean  clustering  algorithm  was  used  to  group  the  protein- 
bound  sites. 

(D)  RPA,  SMC5,  and  BRCA1  co-occupy  2,204  genomic  regions  in  response  to  10  mM  HU.  The  plot  in  each  column,  from  left  to  right,  represents  the  pattern  of 
RPA,  SMC5,  and  BRCA1  genomic  occupancy  in  response  to  HU  centered  on  RPA-bound  sites.  K-mean  clustering  algorithm  is  used  to  group  the  protein-bound 
sites. 

(E)  The  Venn  diagram  shows  the  overlap  of  sites  bound  by  RPA,  SMC5,  and  BRCA1  in  response  to  10  mM  HU.  The  total  number  of  bound  sites  is  indicated  for 
each  shared  and  unique  area. 

(F)  Relative  frequency  of  ERFSs  in  classes  of  repetitive  sequences  is  shown.  Dashed  line  indicates  the  expected  frequency  based  on  the  permutation  model 
(*,  enriched  repetitive  element  classes;  p  <  1  x  10-3). 

(G)  ERFSs  are  enriched  in  CpG  islands.  Total  CpG  island  sequences  in  all  the  2,204  ERFSs  as  indicated  by  the  crossed  red  point  is  compared  to  the  permutation 
model  as  indicated  by  the  gray  points.  Each  gray  point  corresponds  to  the  total  CpG  island  sequences  covered  in  an  iteration  of  the  permutation  model.  The  box 
plot  depicts  the  quantiles  of  total  CpG  sequences  based  on  the  permutation  model  (p  <  1  x  10-5). 

(H)  ERFS  genomic  regions  are  transcriptionally  active.  The  line  plot  represents  the  average  RNA  tag  count  (loess  smoothed)  in  a  genomic  window  around  the 
center  of  the  ERFSs. 

(I)  ERFSs  are  enriched  in  transcriptionally  active  convergent  and  divergent  gene  pairs.  Count  of  divergent/convergent  gene  pairs  coinciding  with  ERFSs  as 
indicated  by  the  crossed  red  point  is  compared  to  the  permutation  model  as  indicated  by  the  gray  points.  Each  gray  point  corresponds  to  the  total  number  of 
divergent/convergent  gene  pairs  observed  in  an  iteration  of  the  permutation  model.  The  box  plot  depicts  the  quantiles  of  the  total  convergent/divergent  transcript 
pair  count  based  on  the  permutation  model  (p  <  1  x  1CT5).  For  definition  of  convergent/divergent  gene  pairs  see  Experimental  Procedures.  See  also  Figures  SI , 
S3,  S4. 
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Figure  2.  ERFS  “Hot  Spots”  Associate  with 
Highly  Transcribed  Gene  Clusters 

(A)  Gene  tracks  represent,  from  the  top,  ERFS  and 
ERFS  hot  spot  demarcations;  bindings  of  RPA, 
BRCA1,  SMC5,  yH2AX  occupancy;  and  BrdU 
incorporation  near  the  IKZF1  locus.  The  y  axis 
represents  the  total  number  of  mapped  reads  per 
million  of  mapped  reads  (RPM)  in  200  nucleotide 
windows  (sliding-window  smoothed). 

(B)  Genome-wide  map  of  619  ERFS  hot  spots. 
Each  hot  spot  is  represented  by  a  green  dot  on  the 
ideograms.  The  top  fifteen  hot  spots  are  color- 
coded  in  red. 

(C)  Table  of  the  top  15  ERFS  hot  spots.  ERFS  hot 
spots  are  ordered  based  on  a  ranked  statistics  of 
RPA/SMC5/BRCA1 -binding  strength  (see  Experi¬ 
mental  Procedures).  The  first  column  depicts 
a  representative  gene  within  the  hot  spot.  A  hot 
spot  containing  at  least  three  genes  is  designated 
as  a  “gene-cluster.”  A  hot  spot  with  a  gene  tran¬ 
script  value  greater  than  1  RPKM  (reads  per  kilo- 
base  exon  model  per  million  mapped  reads)  is 
designated  as  transcribed.  ERFS  rearrangements 
in  B  cell  cancers  are  listed  in  Table  S2.  ERFS  is 
designated  as  “AID-target”  according  to  (Chiarle 
et  al.,  201 1 ;  Klein  et  al.,  2011).  For  complete  defi¬ 
nition  of  columns  see  Experimental  Procedures. 
See  also  Tables  SI  and  S2. 


have  a  lower  gene  density  (Figure  2B). 
An  examination  of  the  top  15  hot  spots 
based  on  a  ranked  statistics  of  RPA/ 
BRCA1/SMC5-binding  strength  showed 
that  9  out  of  the  15  regions  contained 
gene  clusters  with  at  least  three  genes, 
and  12  out  of  15  exhibited  divergent/ 
convergent  gene  pairs  (Figures  2A  and 
2C;  Table  SI).  Of  note,  8  out  of  15  hot 
spots  are  also  rearranged  in  B  cell 
lymphomas  (Figure  2C;  Table  S2),  sug¬ 
gesting  a  possible  link  among  ERFSs, 
genome  rearrangements,  and  cancer 
(see  below). 

Early  S  Phase  Arrest  by  HU  Induces 
DNA  Damage  at  ERFSs,  but  Not 
at  CFSs 


Replication  is  organized  into  discrete  zones  (30-450  kbp  in 
size)  containing  multiple  replication  origins  that  exhibit  similar 
replication  timing  (Costa  and  Blow,  2007).  Similarly,  approxi¬ 
mately  80%  of  the  ERFSs  are  within  300  kbp  of  one  another 
(Figures  2A  and  S4G).  We  therefore  integrated  these  neighboring 
clustered  ERFSs  and  removed  those  with  footprints  less  than 
10  kbp  to  define  619  triple  colocalized  hot  spot  regions  (Fig¬ 
ure  2B;  Table  SI).  Interestingly,  whereas  these  hot  spots  were 
distributed  throughout  the  genome,  the  density  of  hot  spots  on 
autosomes  was  higher  than  on  the  sex  chromosomes,  which 


DNA  damage  at  CFSs  is  visualized  by 
conventional  cytogenetic  analysis  of 
metaphase  chromosomes  (Durkin  and  Glover,  2007).  To  investi¬ 
gate  whether  the  ERFSs  defined  by  RPA/  BRCA1/SMC5  binding 
are  prone  to  actual  breakage,  we  again  treated  cells  with  1 0  mM 
HU,  released  them  into  fresh  medium  overnight,  and  examined 
metaphase  spreads.  Chromatid  breaks,  chromosome  breaks, 
and  rearrangements  could  be  discerned  in  20%-60%  of  WT 
cells  after  HU  treatment  (Figure  S2D).  To  determine  whether 
ERFSs  are  more  sensitive  to  breakage  under  replication  stress 
than  regions  lacking  RPA/BRCA1/SMC5  binding  (i.e.,  cold 
spots),  we  hybridized  metaphases  with  bacterial  artificial 
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chromosome  (BAC)  probes  corresponding  to  six  ERFS  hot  spots 
(MHCII,  GIMAP,  SWAP70,  BACH2,  IKZF1 ,  and  FOXP1)  (Figures 
3A  and  3B),  two  cold  spots  ( CNTNAP4  and  SLITRK6)  and  two 
CFSs  (FRA8E1  and  FRA14A2).  For  each  of  the  six  ERFS  hot 
spots,  a  total  of  at  least  40  chromosome  aberrations  were 
counted  (Table  S3).  Notably,  all  six  ERFS  hot  spots  displayed 
chromosome  aberrations  in  metaphases  from  HU-treated 
samples  (Figure  3B).  In  contrast,  neither  of  the  cold  regions  or 
CFSs  was  broken  under  the  same  conditions  (Figure  3B).  Over¬ 
all,  8%-1 5%  of  the  total  damage  localized  to  individual  ERFS  hot 


Figure  3.  ERFS  Break  in  Response  to  HU 

(A)  Upper:  diagram  of  FISH  probes.  Lower: 
representative  DNA aberrations  identified  by  FISH. 
Blue  is  DAPI-stained  DNA,  green  represents  the 
BAC  probe  {MHCII,  GIMAP,  SWAP70,  BACH2, 
IKZF1,  or  FOXP1)  and  red  marks  telomeric  DNA. 

(B)  HU-induced  aberrations  were  found  at  ERFSs 
but  not  at  “cold  sites”  ( CNTNAP4 ,  SLITRK6)  or 
CFSs  (FRA8E1 ,  FRA14A2).  Quantitation  of  abnor¬ 
malities  from  FISH  analysis  of  untreated  cells  (blue 
bars)  or  cells  treated  with  10  mM  HU  (red  bars). 
The  percent  aberrations  specifically  at  the  BAC 
probes  relative  to  the  total  damage  is  plotted. 

(C)  Abnormalities  detected  by  FISH  in  untreated 
(blue  bars)  and  10  mM  HU-treated  (red  bars) 
XRCC2-'-  cells. 

(D)  Upper:  diagram  of  FISH  probes.  Lower: 
representative  metaphase  showing  a  sponta¬ 
neous  break  at  the  GIMAP  locus  in  an  XRCC2~'~ 
cell. 

(E)  Quantitation  of  abnormalities  detected  by  FISH 
in  untreated  (blue  bars)  and  0.2  ^iM  aphidicolin- 
treated  (red  bars)  WT  cells. 

(F)  Upper:  diagram  of  FISH  probes.  Lower: 
representative  metaphases  showing  aphidicolin- 
induced  breaks  at  the  FRA14A2  and  FRA8E1  loci 
in  WT  cells.  See  also  Figure  S2  and  Table  S3. 


spots,  representing  a  significant  fraction 
of  the  total  damage  (Figure  3B).  DNA 
lesions  were  observed  on  either  the 
centromeric  or  telomeric  sides  of  ERFS- 
specific  hybridized  BAC  (Figure  S2E), 
suggesting  that  an  ERFS  represents 
a  large  fragile  genomic  region. 

Aberrations  at  ERFS  hot  spots  were 
also  detected  in  XRCC2~'~  cells  treated 
with  HU  (Figure  3C).  XRCC2~'~  cells  are 
more  sensitive  to  HU  than  WT  cells  are, 
as  evidenced  by  the  higher  level  of  total 
damage  in  these  cells  (Figure  S2D). 
Breaks  at  MHCII,  GIMAP,  SWAP70, 
BACH2,  IKZF1,  and  FOXP1  were  found 
in  5%-10%  of  HU-treated  XRCC2 
cells  compared  with  1  %-6%  of  WT  cells 
damaged  in  these  regions  (Table  S3). 

4A2  fra8ei  Nevertheless,  the  frequency  of  ERFS- 

specific  instability  relative  to  the  total 
damage  was  similar  in  XRCC2~/~  and 
WT  cells  (Figures  3B  and  3C).  Interestingly,  breaks  in  the  vicinity 
of  the  GIMAP  hot  spot  were  detectable  spontaneously  in 
XRCC2~/~  cells  (Figures  3C  and  3D;  Table  S3),  which  is  consis¬ 
tent  with  increased  y-H2AX  observed  in  unchallenged  XRCC2 
mutant  cells  (Figure  S2A). 

None  of  the  eight  CFSs  defined  in  mouse  (Helmrich  et  al., 
2006)  were  among  our  619  ERFS  hot  spots  (Table  SI).  Consis¬ 
tent  with  this,  DNA  aberrations  at  two  of  the  most  expressed 
CFSs  in  mouse  lymphocytes,  FRA14A2  and  FRA8E1  (Helmrich 
et  al.,  2006)  were  undetectable  in  HU-treated  WT  samples 
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Figure  4.  ERFS  Break  in  Response  to  ATR 
Inhibition  and  High  Transcription 

(A)  Quantitation  of  aberrations  observed  by  FISH 
in  response  to  overnight  exposure  to  1  fiM  ATRi  in 
WT  (blue  bars)  and  XRCC2~/~  cells  (red  bars). 

(B)  Gene  tracks  represent,  from  the  top,  ERFS 
demarcation  and  transcription  measured  by  RNA- 
Seq  in  T  and  B  cells  at  the  region  flanking  SWAP70 
locus. 

(C)  Relative  transcriptional  activities  of  GIMAP  and 
SWAP70  loci  in  B  and  T  cells  and  their  relation  to 
the  ERFS  fragility.  GIMAP  and  SWAP70  hot  spots 
are  shown  in  separate  facets.  The  x  axis  shows  the 
cell  lineage.  The  y  axis  upward  depicts  the 
loglO(RPKM)  in  B  and  T  cells  by  dark  and  light 
reds,  respectively;  the  y  axis  downward  depicts 
the  quantitation  of  aberrations  observed  by  FISH 
in  response  to  overnight  exposure  to  1  fiM  ATRi  in 
B  and  T  cells  in  dark  and  light  blue,  respectively. 

(D)  Relative  SWAP70  mRNA  abundance 
(measured  across  exon  4)  normalized  to  p-actin  in 
WT  and  SWAP70~/~  B  cells  (mean  ±  SD). 

(E)  Quantitation  of  aberrations  in  WT  and 
SWAP70~/-  cells  at  the  GIMAP  and  SWAP70 
regions  in  response  to  10  mM  HU.  See  also  Fig¬ 
ure  S5  and  Table  S3. 
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(Figure  3B).  Absence  of  CFS  expression  could  be  explained  by 
the  fact  that  high  concentrations  of  HU  stall  replication  forks  in 
early  S  phase  (Figure  1A),  whereas  CFSs  replicate  late  (Durkin 
and  Glover,  2007).  Conversely,  we  found  that  overnight  treat¬ 
ment  with  low  doses  of  aphidicolin  (0.2  pM  for  20  hr)  induced 
damage  at  the  CFSs  FRA1 4A2  and  FRA8E1 ,  whereas  the  ERFSs 
GIMAP  and  SWAP70  were  largely  insensitive  (Figures  3E  and 
3F).  These  data  are  consistent  with  the  idea  that  ERFS  arise 
from  fork  collapse  during  early  replication,  whereas  breakage 
at  CFSs  arises  from  a  failure  to  replicate  (Debatisse  et  al., 
2012),  and  the  two  forms  of  replication  stress  induce  distinct 
types  of  recurrent  DNA  lesions. 


ATR  Inhibition  Promotes  ERFS 
and  CFS  Expression 

The  ATR  kinase  protects  the  genome 
from  chromosomal  aberrations  at  late 
replicating  CFSs,  (Durkin  and  Glover, 

2007)  and  is  essential  for  stabilizing 
stalled  forks  and  facilitates  fork  restart  in 
early  S  phase  (Cimprich  and  Cortez, 

2008) .  To  confirm  that  ATR  inactivation 
induces  CFSs  and  determine  whether  it 
similarly  leads  to  damage  at  ERFSs,  we 
treated  asynchronous  B  cells  on  day  2 
with  1  pM  of  a  recently  described  ATR 
inhibitor  (ATRi)  (Toledo  et  al.,  2011).  We 
found  that  approximately  2.5%  and 
7.0%  of  the  total  chromosomal  aberra¬ 
tions  localized  to  the  two  CFSs,  FRA8E1 
and  FRA14A2,  respectively  (Figure  4A). 
ATR  deficiency  also  led  to  chromosomal 
aberrations  at  ERFSs  at  a  similar 

frequency  (Figure  4A;  Table  S3).  Moreover,  ERFSs  and  CFSs 
were  both  damaged  in  XRCC2~/~  cells  treated  with  ATRi  (Fig¬ 
ure  4A).  Thus,  the  rupture  of  unreplicated  regions  at  CFSs  and 
fork  collapse  at  ERFSs  are  similarly  sensitive  to  ATR  inhibition. 

Transcriptional  Activity  Can  Increase  ERFS  Fragility 

As  described  above,  ERFSs  are  enriched  in  regions  with  high 
transcriptional  activity  (Figures  1H,  2C,  and  S4F;  Table  SI).  To 
determine  the  contribution  of  transcriptional  activity  to  individual 
ERFSs,  we  focused  on  loci  with  tissue-specific  transcription 
patterns.  SWAP70  is  a  B-cell-specific  developmental  regulator, 
whereas  genes  within  the  GIMAP  cluster  are  expressed  both  in 
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B  and  in  T  cells  (Figures  4B  and  S5 A).  Treatment  with  ATRi  led  to 
a  similar  frequency  of  damage  at  GIMAP  in  B  and  T  cells,  consis¬ 
tent  with  insignificant  changes  in  gene  expression  between  the 
two  cell  types  (Figure  4C).  In  contrast,  damage  near  SWAP70 
was  3-fold  lower  in  T  than  in  B  cells  (Figure  4C;  Table  S3),  which 
correlated  with  the  decreased  transcription  of  SWAP70  in  T  cells 
(Figure  4B).  Nevertheless,  the  replication  timing  near  SWAP70 
was  similar  in  both  cell  types  (Figure  S5B).  To  further  delineate 
the  role  of  transcription  on  ERFS  breakage,  we  used 
SWAP70~/~  mice  in  which  2.7  kbp,  including  the  first  exon  and 
part  of  the  5'  untranslated  region,  is  removed  (Borggrefe  et  al., 
2001),  allowing  us  to  compare  the  fragility  of  ERFSs  in  the 
same  genomic  region  in  knockout  B  cells.  We  determined  that 
SWAP70  mRNA  in  SWAP70~/~  B  cells  was  reduced  by  approx¬ 
imately  4-fold  relative  to  levels  in  WT  (Figure  4D).  Moreover,  DNA 
damage  near  SWAP70  was  approximately  2.5-fold  lower  in 
SWAP70~/~  relative  to  levels  in  WT  B  cells  (Figure  4E).  In 
contrast,  DNA  damage  near  GIMAP  remained  at  a  similar  level 
both  in  WT  and  SWAP70~/~  cells  (Figure  4E).  Although  our 
data  indicate  that  high  level  of  transcription  contributes  to  the 
breakage  of  some  ERFSs,  other  molecular  features,  including 
repetitive  elements  (Figure  1 F),  covalently  bound  protein 
complexes,  and  RNA:DNA  hybrids,  might  also  be  sources  of 
ERFS  fragility. 

Oncogenic  Stress  Can  Trigger  ERFS  and  CFS  Fragility 

Oncogene  deregulation  is  thought  to  compromise  genome 
integrity  preferentially  at  CFSs  (Bartek  et  al.,  2007;  Halazonetis 
et  al.,  2008),  and  CFS  deletion  has  been  associated  with  various 
cancers  (Bignell  et  al.,  2010).  To  determine  whether  oncogenic 
stress  similarly  induces  DNA  damage  at  ERFSs,  we  overex¬ 
pressed  c-myc  in  B  cells  because  it  has  been  implicated  in  regu¬ 
lating  replication  initiation  and  origin  firing  (Dominguez-Sola 
et  al.,  2007).  XRCC2~'~  cells  were  utilized  to  increase  the  amount 
of  replicative  stress  and  DNA  damage  as  a  result  of  decreased 
HR  (Figure  S2D).  c-myc  overexpression  led  to  induction  of 
p53  (Figure  5A),  which  correlated  with  an  approximately  1.6- 
fold  increase  in  overall  DNA  damage  in  XFCC2_/_  cells  overex¬ 
pressing  c-myc  compared  to  empty  vector  (EV)-infected  cells 
(Table  S3).  Moreover,  7.3%  of  the  total  breaks  generated  in 
c-myc  overexpressing  cells  were  found  near  SWAP70, 
compared  to  2.4%  of  total  breaks  at  this  ERFS  in  EV-infected 
B  cells  (Figure  5B).  Similarly,  out  of  43  breaks  observed  in 
c-myc -infected  cells,  3  (7%)  were  found  at  the  GIMAP  cluster, 
and  3  (6.7%)  were  found  near  BACH2.  c-myc  overexpression 
also  induced  breaks  at  FRA8E1,  showing  a  2-fold  relative 
increase  in  breaks  relative  to  EV-infected  cells  (Figure  5B). 
Thus,  DNA  damage  induced  by  c-myc  overexpression  can  occur 
at  ERFSs  and  CFSs. 

ERFS  Fragility  Is  AID  Independent 

Mutations  and  DSBs  at  various  oncogenes,  including  c-myc,  are 
due  to  AID  off-target  activity  (Robbiani  et  al.,  2008).  Recently, 
a  number  of  genome-wide  studies  in  primary  B  cells  mapped 
AID-induced  DNA  translocation  events,  and  identified  several 
novel  hot  spots  for  AID-dependent  translocations  at  non-lg 
genes  (Chiarle  et  al.,  2011;  Kato  et  al.,  2012;  Klein  et  al.,  2011). 
Among  these  translocation  hot  spots,  MHCII,  GIMAP,  IKZF1, 


PVT1 ,  ETS1,  IRF4,  and  NFkBI  were  located  within  the  top  15 
ERFS  hot  spots  in  this  study,  whereas  the  IgFI  locus  (the  physio¬ 
logic  target  of  AID)  was  not  ranked  high  on  the  list  (Figure  2C; 
Table  SI).  To  determine  whether  AID  contributes  to  ERFS 
fragility,  we  stimulated  WT  and  AID  knockout  B  cells  with  LPS/ 
IL4  for  2  days,  and  then  treated  them  with  ATRi  overnight.  These 
conditions  induce  robust  AID-dependent  DNA  damage  simulta¬ 
neously  with  replication  stress.  We  probed  metaphases  with 
BACs  spanning  the  IgFI  locus,  the  GIMAP  cluster,  and  IKZF1- 
all  AID  translocation  hot  spots— as  well  as  BACFI2,  SWAP70, 
FOXP1 ,  and  BCL2  (Figure  S2E)— ERFSs  that  are  frequently  rear¬ 
ranged  in  B  cell  lymphoma  (Figure  2C;  Tables  SI  and  S2).  In  WT, 
the  IgH  locus  was  damaged  in  3.8%  of  cells,  but  the  frequency  of 
IgFI- specific  instability  did  not  increase  with  ATRi  (Figure  S2F), 
despite  the  fact  that  ATRi  greatly  increased  overall  damage 
(Table  S3).  Upon  ATRi  treatment,  the  frequency  of  breaks  at 
the  ERFSs  GIMAP,  IKZF1 ,  BACH2,  SWAP70,  and  FOXP1 ,  and 
BCL2  were  elevated  to  the  levels  similar  to  those  observed  at 
the  IgFI  in  activated  B  cells  (Figure  S2F).  Breaks  at  some  ERFSs 
were  even  spontaneously  detected  (FOXP1  and  GIMAP, 
Figure  S2F). 

To  determine  whether  AID  expression  contributes  to  aberra¬ 
tions  observed  at  ERFSs,  we  next  analyzed  their  breakage 
frequency  in  AID~'~  cells.  Unlike  WT  cells,  IgFI  breaks  were 
absent  in  AID~'~  cells.  In  contrast,  all  ERFSs  exhibited  similar 
levels  of  breakage  both  in  WT  and  A!D~'~  cells  (Figure  5C;  Table 
S3).  Therefore,  whereas  IgFI  breaks  in  B  cells  are  entirely  AID 
dependent,  the  breakage  of  ERFSs  is  AID  independent.  Alto¬ 
gether,  these  data  suggest  that  some  recurrent  rearrangements 
in  B  cell  lymphoma  are  due  to  AID-independent  replicative  stress 
at  ERFSs. 

Genome  Instability  at  ERFSs  Is  Observed  in  Mouse 
Models  and  Human  Cancer 

Among  the  top  1 5  ERFS  hot  spot  that  break  in  response  to  AID- 
independent  replication  stress,  we  have  identified  three  partners 
that  recurrently  translocate  to  IgFI  in  lymphomas:  BACFI2, 
FOXP1,  and  BCL2  (Table  S2).  We  hypothesized  that  if  AID- 
dependent  DSBs  in  G1  persisted  into  early  S  phase,  transloca¬ 
tions  between  AID-dependent  breaks  and  ERFS  might  be 
detectable.  To  test  this,  we  examined  cells  transgenically  over¬ 
expressing  AID  and  simultaneously  deficient  for  53BP1 
(lgicAID/53BP7_/_),  thus  allowing  the  persistence  of  G1  IgFI 
breaks  into  S  phase  where  they  could  be  joined  to  ERFSs. 
Indeed,  26%  and  7%  of  lgicAID/53PP7_/“  B  cells  carried  IgFI 
locus  and  BACFI2  breaks,  respectively  (Figure  5D).  These  breaks 
are  fusogenic  because  IgFI-  and  BAC/-/2-associated  transloca¬ 
tions  to  unidentified  partner  chromosomes  were  found  in  7.3% 
and  1.2%  of  the  metaphases,  respectively  (Figure  5D).  Impor¬ 
tantly,  we  also  detected  one  lgFI/BACFI2  translocation  among 
750  cells  (Figure  5E),  reminiscent  of  the  lgH/BACH2  transloca¬ 
tions  observed  in  human  B  cell  lymphoma  (Kobayashi  et  al., 
2011).  Thus,  AID-dependent  breaks  generated  in  G1  (Petersen 
et  al.,  2001)  can  join  to  ERFS  breaks  triggered  in  early  S  phase. 

A  hallmark  of  cancer  genomes  is  widespread  copy-number 
changes,  insertions,  and  deletions.  To  determine  whether  dele¬ 
tions  and/or  amplifications  at  ERFSs  are  a  general  feature  of 
the  B  cell  lymphoma  genome,  we  compared  our  ERFSs  with 
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Figure  5.  ERFS  Fragility  Is  Observed  in 
Response  to  Oncogenic  Stress  and  in 
Human  Cancer 

(A)  Western  blot  for  phosphorylated  p53  in  c-myc 
and  EV-infected  XPCC2~'~  B  cells. 

(B)  Aberrations  in  c-myc- infected  and  EV-infected 
XRCC2~/  B  cells. 

(C)  Aberrations  in  WT  (blue  bars)  and  AID~'~  B 
cells  (red  bars)  treated  with  1  pM  ATRi. 

(D)  Spontaneous  chromosome  breaks  (blue  bars) 
and  translocations  (red  bars)  at  the  IgH  and 
BACH2  locus  in  lgi<AID/53BP7^/_  B  cells. 

(E)  Normal  chromosomes  and  a  translocation  of 
BACH2  ERFS  (red)  to  the  IgH  locus  (green)  is 
shown. 

(F  and  G)  ERFSs  significantly  overlap  with  MCRs 
detected  in  DLBCL.  The  Venn  diagram  shows  the 
overlap  of  ERFSs  with  MCR  found  in  DLBCL.  The 
total  number  of  regions  is  indicated  for  each 
shared  and  unique  area  and  color-coded  based  on 
the  region’s  title. 

(G)  Significance  of  correlation  between  the  ERFSs 
and  MCRs  is  evaluated  relative  to  the  permutation 
model  and  CFSs.  The  percent  increase  in  the 
overlap  between  the  ERFSs  and  MCRs  relative  to 
the  permutation  model’s  expectation  (mean  ± 
SEM,  p<  1  x  1CT4)  and  CFSs  are  shown  in  the  left 
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high  resolution  copy-number  changes  detected  in  biopsies 
of  patients  with  diffuse  large  B  cell  lymphoma  (DLBCL),  the 
most  common  type  of  non-Hodgkins  lymphoma  (Lenz  et  al., 
2008).  A  total  of  190  “minimal  common  regions”  (MCRs)  were 
found  among  203  biopsies,  carrying  a  gain  or  a  loss  of  a  chromo- 


BCL2  and  right  bar  graphs,  respectively. 

(H)  ERFSs  are  enriched  for  known  cancer  genes. 
The  pie  chart  shows  the  fraction  of  putative  cancer 
genes  (Bignell  et  al.,  2010)  associated  with  ERFSs 
(p  <  6  x  10-20).  See  also  Figure  S6  and  Tables  S3 
and  S4. 
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somal  region  ranging  in  size  from  5  kbp 
to  21Mbp  (Lenz  et  al.,  2008).  Mouse 
ERFS  coordinates  were  overlaid  onto 
the  human  genome  using  two  methods, 
yielding  2,205  syntenic  regions  (Figures 
S6B-S6D).  Notably,  51 .6%  of  the  MCRs 
observed  in  primary  DLBCL  overlapped 
with  syntenic  ERFS  regions  (p(permuta- 
tion)  <  1  x  10-4,  Figure  5F).  Moreover, 
20.4%  of  ERFSs  overlapped  with  MCRs, 
32%  higher  than  expectation  (p(permuta- 
tion)  <  1  x  10-6,  Figure  5G).  Surprisingly, 
ERFS  were  deleted  or  amplified  in 
DLBCL  at  least  81%  more  frequently 
as  compared  to  CFSs,  despite  their 
cancer-specific  propensity  for  breakage 
(Figure  5G).  Moreover,  our  analysis  indi¬ 
cated  that  the  DLBCL  copy-number  alter¬ 
ations  exhibited  2-fold  higher  correlation 
with  B  cell  ERFSs  compared  to  deletions 
and/or  amplifications  in  T  lineage  acute 
lymphoblastic  leukemia  (Figure  S6A)  (Zhang  et  al.,  2012a). 
Finally,  by  examining  homozygous  deletions  in  cancer  genomes 
(Bignell  et  al.,  2010),  we  found  that  25  out  of  64  genes  known  to 
contribute  to  oncogenesis  coincide  with  ERFSs  (p(hypergeomet- 
ric)  <  6  x  IQ"20,  Figure  5H;  Table  S4).  Based  on  these  findings, 


ERFS-associated 
(25,  39.1%) 


classification  of 
putative  cancer  genes 


628  Cell  752,  620-632,  January  31,  2013  ©2013  Elsevier  Inc. 


Cell 


G1  phase 


S  phase 


.0. 


AID  — ^  I  J  AID  target 

I  (IgH,  c-Myc) 


AID-dependent  AID-independent 

damage  damage 


S  phase 


Replication 

Replication 

stress 

stress 

=><^ 

ERFS 

oncogene 
(Bach2,  Bcl-2) 

— 

— 

ERFS 
oncogene 
(Bach2,  Bcl-2) 


AID-independent 

damage 


subsequent 
G1  phase 


cell  divides  with 
ERFS  damage 


I 


AID  target 
(IgH,  c-Myc) 


AID-dependent 

damage 


Figure  6.  Model  for  Recurrent  Rearrange¬ 
ments  in  B  Cell  Lymphomas 

AID  is  active  in  G1  (Petersen  et  al.,  2001)  and 
targets  IgH  and  various  oncogenes  (e.g.,  c-myc). 
Replication  fork  collapse  at  ERFSs  in  S  phase 
occurs  at  preferential  sites  including  various 
cancer-associated  genes  (e.g.,  BCL2,  BACH2).  An 
AID-generated  break  might  be  passed  from  G1  to 
early  S,  where  it  meets  an  ERFS,  which  may 
eventually  result  in  a  translocation  (left).  Alterna¬ 
tively,  an  ERFS  (bearing  unresolved  a  replication 
intermediate  of  under-replicated  DNA)  might  break 
in  mitosis  and  then  become  permissive  to  trans¬ 
locate  to  an  AID-induced  DSBinthenextGI  phase 
of  the  cell  cycle  (right). 


we  conclude  that  ERFSs  are  a  significant  feature  of  the  muta¬ 
tional  landscape  of  diffuse  large  B  cell  lymphomas  and  poten¬ 
tially  other  cancers. 

DISCUSSION 

Replicative  Stress  at  ERFSs  Contributes  to  Genome 
Instability  in  B  Cells 

Although  AID  has  been  implicated  in  B  cell  translocations 
(Gostissa  et  al.,  2011),  very  little  is  known  about  the  mechanisms 
of  chromosomal  breakage  at  several  IgH- partner  loci,  including 
BCL2,  BACH2 ,  and  FOXP1 .  Besides  programmed  DNA  damage, 
replication-based  mechanisms  are  a  major  contributor  to  chro¬ 
mosomal  instability  in  cancer  (Liu  et  al.,  2012).  Activated  B  cells 
are  among  the  most  rapidly  dividing  mammalian  cells  (Zhang 
et  al.,  1 988),  which  potentially  exposes  them  to  high  endogenous 
levels  of  replicative  stress.  Here,  we  have  used  a  genome-wide 
approach  to  identify  a  subset  of  early  replicating  regions  in  the 
B  cell  genome  that  are  particularly  vulnerable  to  fork  collapse 
and  contribute  to  rearrangements  in  B  cell  malignancies.  In  our 
model,  ERFS  breaks  can  occur  after  the  generation  of  unre¬ 
paired  AID-induced  breaks  in  G1,  and  the  two  breaks  could 
recombine  during  S  or  G2.  Alternatively,  ERFS  damage  might 
persist  through  mitosis  resulting  in  DNA  breaks  in  the  subse¬ 
quent  G1  phase  when  AID  is  predominantly  active.  In  either 
case,  we  suggest  that  AID-mediated  DSBs  in  G1  (Petersen 
et  al.,  2001),  together  with  replication-stress-induced  damage 
at  recurrent  loci,  can  coordinately  drive  B  cell  lymphoma  initia¬ 
tion  and  progression  (Figure  6). 


ERFS  versus  CFS 

CFSs  are  considered  to  be  the  most 
replication-stress-sensitive  sites  in  the 
genome  (Durkin  and  Glover,  2007). 
Although  no  single  mechanism  accounts 
for  CFS  instability,  it  is  hypothesized  that 
a  number  of  different  characteristics 
may  contribute  to  their  fragility  including 
co-occurrence  with  very  large  genes, 
late  replication,  low  density  of  replication 
origins,  high  A-T  content,  and  sequences 
prone  to  form  secondary  structures, 
histone  hypoacetylation,  and  a  con¬ 
densed  chromatin  structure  (Helmrich  et  al.,  2011;  Jiang  et  al., 
2009;  Letessier  et  al.,  2011;  Ozeri-Galai  et  al.,  2011).  In  stark 
contrast  to  CFSs,  our  identified  ERFSs  replicate  early;  have 
an  open  chromatin  configuration;  and  are  origin-,  gene-,  and 
G-C-rich. 

Despite  these  diametrically  opposite  properties,  both  CFS 
and  ERFS  fragility  are  increased  by  ATR  inhibition  (Figure  4A), 
oncogenic  stress  (Figure  5B),  and  deficiencies  in  HR  (Figure  3C) 
(Bartek  et  al.,  2007;  Durkin  and  Glover,  2007;  Halazonetis  et  al., 
2008).  These  conditions  decrease  the  rate  of  fork  progression 
but  concomitantly  increase  the  density  of  replication  initiating 
events  (Bester  et  al.,  2011;  Daboussi  et  al.,  2008;  Dominguez- 
Sola  et  al.,  2007;  Shechter  et  al.,  2004),  which  might  contribute 
to  the  damage  at  both  CFSs  and  ERFSs,  respectively.  The 
decrease  in  fork  speed  hinders  the  completion  of  replication 
at  CFSs,  either  because  of  the  scarcity  of  origins  near  CFSs 
(Letessier  et  al.,  2011),  the  heterochromatic  nature  of  the  regions 
that  would  limit  accessibility  of  DNA  replication  and/or  DSB 
repair  machineries  (Jiang  et  al.,  2009),  or  because  of  the  interfer¬ 
ence  between  transcription  and  replication  at  very  large  genes 
(Helmrich  et  al.,  2011).  Although  additional  origins  are  not 
activated  near  CFSs  upon  replication  stress  (Letessier  et  al., 
2011),  an  increase  in  origin  activity  at  early  replicons  might 
paradoxically  contribute  to  genome  instability  at  ERFSs.  For 
example,  increasing  the  replication  initiation  events  near  highly 
transcribed  gene  clusters  with  divergent  and/or  convergent 
gene  pairs  could  increase  conflicts  between  DNA  replication 
and  transcription  machineries.  The  higher  density  of  activated 
origins  at  ERFSs  would  also  be  expected  to  prematurely  deplete 
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nucleotide  pools  (Bester  et  al.,  2011),  thereby  increasing  the 
probability  of  subsequent  fork  stalling  and  collapse.  These  two 
outcomes  of  replication  stress  are  likely  to  be  linked  because 
increased  replication  initiation  and  depletion  of  nucleotide 
supplies  slows  replication  (Bester  et  al.,  2011;  Jones  et  al., 
2012),  whereas  slow  fork  progression  causes  activation  of 
dormant  origins  (Ge  et  al.,  2007),  and  both  incomplete  replication 
and  increased  origin  firing  are  monitored  by  ATR  activity 
(Shechter  et  al.,  2004).  In  conclusion,  increased  initiating  events 
at  ERFSs  and  a  paucity  of  replication  initiation  at  CFSs  could 
both  challenge  replication  fidelity. 

ERFSs  and  Cancer 

Oncogenic  stress  is  a  major  driving  force  in  the  early  stages  of 
cancer  development  (Halazonetis  et  al.,  2008);  nevertheless, 
the  factors  that  trigger  replicative  stress  in  vivo  remain  unclear. 
In  the  case  of  B  cell  lymphomas,  oncogenic  stress  can  be  initi¬ 
ated  by  the  activity  of  AID,  which  by  targeting  non-/g  genes 
such  as  c-myc  (Robbiani  et  al.,  2008),  leads  to  c-myc/IgH  trans¬ 
locations  and  consequent  aberrant  c-myc  expression.  This  form 
of  AID-induced  oncogenic  stress  or  high  levels  of  proliferative 
activity  in  activated  B  cells  could  generate  DNA  damage  at 
ERFSs  (Figure  6). 

Altogether,  1 03  AID  hot  spots  (Chiarle  et  al.,  201 1 ;  Klein  et  al., 
201 1)  — including  the  GIMAP  cluster,  MHCII  locus,  and  IKZF1  — 
were  also  identified  as  ERFS  hot  spots  in  this  study  (Table  SI). 
It  is  possible  that  the  overlap  observed  between  a  subset  of 
off-target  AID  sites  and  ERFSs  is  due  to  common  underlying 
features  of  these  loci.  For  example,  AID  is  recruited  to  ssDNA 
regions  (Chaudhuri  and  Alt,  2004),  which  are  also  generated 
during  replicative  stress;  AID-dependent  DSBs  and  ERFSs  are 
also  both  enriched  in  repeat  elements  (Staszewski  et  al.,  2011). 
In  addition,  chromosomal  regions  with  the  highest  transcriptional 
activity  have  the  highest  AID-dependent  translocation  density 
(Chiarle  et  al.,  2011;  Klein  et  al.,  2011),  and  early  origins  and 
translocations  frequently  reside  near  transcription  start  sites 
and  RNA  polymerase-ll-binding  sites  (Chiarle  et  al.,  2011;  Klein 
et  al.,  2011).  Thus,  these  euchromatic  regions  could  serve  both 
as  AID  targets  in  G1  and  also  be  susceptible  to  fork  collapse 
during  early  S  phase. 

A  number  of  different  hypotheses  have  been  put  forward 
about  the  mechanisms  that  promote  recurrent  translocations 
in  mature  B  cell  lymphomas.  These  include  recurrent  genomic 
damage  by  AID,  random  DNA  damage  followed  by  selection, 
and  a  nonrandom  3D  organization  of  the  genome  (Chiarle 
et  al.,  2011;  Flakim  et  al.,  2012;  Klein  et  al.,  2011;  Zhang 
et  al.,  2012b).  To  date,  replication-stress-induced  DNA  damage 
has  been  associated  with  late-replicating  CFS.  By  using  an 
alternative  experimental  approach  for  the  discovery  of  fragile 
site  expression  during  early  replication,  we  have  identified 
a  novel  source  of  recurrent  AID-independent  DNA  breaks 
that  may  play  a  mechanistic  role  in  some  of  the  most  common 
genome  rearrangements  during  B  cell  lymphomagenesis. 
Because  transcriptional  activity  and  replication  timing  of  a 
genomic  region  vary  among  different  cell  lineages  (Hansen 
et  al.,  2010),  different  sets  of  ERFSs  might  also  account  for 
recurrent  chromosomal  rearrangements  in  cancers  of  distinct 
cellular  origins. 


EXPERIMENTAL  PROCEDURES 
Mice 

XRCC2~/  { Frappart  et  al.,  2009),  53BP1^/_  (Ward  et  al.,  2004),  IgicAID 
(Robbiani  et  al.,  2009),  AID^'~  (Muramatsu  et  al.,  2000),  and  SWAP70~/~ 
(Borggrefe  et  al.,  2001)  mice  have  been  described.  SWAP70~I~  and  WT 
control  mice  used  in  Figure  4D  are  C57BL/6  background;  all  other  mice  are 
129/Sv  x  C57BL/6  background. 

ChIP-Seq,  Repli-Seq,  RNA-Seq,  DHS  I  Mapping,  and  FISH  Analysis 

ChIP-seq  and  RNA-Seq  procedures  were  performed  as  in  Yamane  et  al. 
(2011),  Repli-Seq  was  performed  as  described  in  Hansen  et  al.  (2010).  DHSI 
mapping  was  performed  as  described  (Sekimata  et  al.,  2009),  and  fluores¬ 
cence  in  situ  hybridization  (FISH)  analysis  is  described  in  Callen  et  al.  (2007). 
For  detailed  methods,  see  Extended  Experimental  Procedures. 

BACs 

Individual  BACs  to  ERFSs  were  identified  using  NCBI  clone  finder  and 
purchased  from  BACPAC.  For  complete  list  of  BAC  probes  used  in  FISH 
experiments  see  Extended  Experimental  Procedures. 

Retroviral  Infection 

Cells  were  infected  with  pMX-c-Myc-IRES-GFP  or  empty  vector  and  GFP- 
positive  cells  were  sorted  as  described  (Robbiani  et  al.,  2008). 

Statistical  and  Computational  Analyses 

Detailed  description  is  available  in  Extended  Experimental  Procedures. 

ACCESSION  NUMBERS 

The  ChIP-seq  and  RNA-seq  data  are  deposited  in  GEO  under  accession 
number  GSE43504. 

SUPPLEMENTAL  INFORMATION 

Supplemental  Information  includes  Extended  Experimental  Procedures,  six 
figures,  and  four  tables  and  can  be  found  with  this  article  online  at  http://dx. 
doi.org/1 0.101 6/j.cell.201 3.01 .006. 
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ABSTRACT 

Germline  mutations  in  PALB2  lead  to  a  familial  predisposition  to  breast  and 
pancreatic  cancer  or  to  Fanconi  Anemia  subtype  N.  PALB2  performs  its  tumor 


suppressor  role,  at  least  in  part,  by  supporting  homologous  recombination  through 
interactions  with  BRCA1,  BRCA2  and  RAD51.  To  further  understand  the  biology  of 
PALB2-mediated  tumor  suppression,  we  targeted  Palb2  in  the  mouse.  Palb2- deficient 
murine  cells  recapitulated  DNA  damage  and  checkpoint  defects  caused  by  PALB2 
depletion  in  human  cells.  Similar  to  what  happens  after  Brcal  and  Brca2  ablation, 
germline  Palb2  deficiency  in  mice  led  to  embryonic  lethality,  which  could  be  delayed  by 
ablation  of  checkpoint  regulator  genes  such  as  Cdknla  (p21)  and  Cdkn2a 
(pl6INK4A/pl9ARF).  In  contrast,  somatic  deletion  of  Palb2  driven  by  K14-Cre  led  to  tumor 
formation  in  the  mammary  glands,  skin,  and  oral  cavity  with  long  latency.  Mutation  or 
loss  of  heterozygosity  (LOH)  of  Trp53  was  frequently  observed  in  these  tumors.  Palb2 
loss  synergized  with  somatic  Trp53  loss  to  markedly  accelerate  tumor  formation. 
Comparative  genomic  hybridization  (CGH)  analysis  of  Palb2/Trp53  compound  mutant 
breast  tumors  revealed  distinct  chromosomal  aberrations  that  only  partially  overlapped 
with  Brcal/Trp53  and  Brca2/Trp53  compound  mutant  tumors,  implying  that  the  Palb2 
tumor  suppression  process  is,  in  part,  distinct  from  that  executed  by  Brcal  and  Brca2. 
These  observations  underscore  the  usefulness  of  this  new  mouse  model  to  investigate  the 
molecular  mechanisms  underlying  the  breast  tumor  suppression  functions  of  Palb2. 


INTRODUCTION 


PALB2  (Partner  and  Localizer  of  BRCA2)  was  recently  identified  as  a  major 
interacting  protein  of  the  breast  cancer  susceptibility  gene  product  BRCA21.  This 
interaction  is  required  for  the  repair  of  DNA  double  strand  breaks  by  homologous 
recombination  (HR),  since  PALB2  is  necessary  for  the  chromatin  association  of  BRCA2 
and  its  partner,  RAD511.  RAD51  is  the  central  recombinase  in  HR,  involved  in  D-loop 
formation  and  strand  displacement  (PMID:  12778123).  PALB2  also  plays  a  BRCA2- 
independent  role  in  the  HR  process  by  enhancing  RAD51  function2, (PMID  20871615)_ 

PALB2  was  also  found  to  interact  with  BRCA1  and,  as  such,  supports  the  long 
known  interaction  between  BRCA1  and  BRCA23’4.  Loss  of  PALB2  does  not  affect 


BRCA1  recruitment  to  irradiation-induced  foci  (IRIF),  but  abrogates  localization  of 


BRCA2  and  RAD5 1  at  these  structures.  Genetic  analyses  have  shown  that,  like  BRCA2, 
a  member  of  Fanconi  anemia  complementation  group  Dl,  PALB2  is  also  a  Fanconi 
anemia  protein  (FANCN)5’ 6  and  a  breast  tumor  suppressor  in  its  own  right7'10.  Unlike  the 
case  of  BRCA1  and  BRCA2 -mutant  tumors,  only  some  PALB  2 -associated  breast  cancers 
have  undergone  loss  of  PALB2  heterozygosity  (LOH)  ’  .  This  implies  that  a  reduction  of 
PALB2  gene  copy  number  by  half  is  sufficient  to  allow  breast  cancer  development  in 
some,  but  not  all  settings.  Why  this  difference  exists  is  an  open  question,  and  an 
experimental  to  address  this  question,  among  others,  is  in  order. 

Breast  cancer  in  PALB2- mutated  families  appears  to  be  of  intermediate 

O  1  A 

penetrance,  unlike  that  in  BRCA1/2  families  ’  .  Although  PALB2  mutations  are  rarer 
than  BRCA1/2  mutations,  available  clinical  data  indicate  that  heterozygous,  germline 
PALB2  mutations  do  not  phenocopy  precisely  either  BRCA1  or  BRCA2  cancer 
predisposition  syndromes  ’  ,  consistent  with  the  notion  that  PALB2  biological  functions 
extend  beyond  solely  bridging  BRCA1-BRCA2  complex  formation.  Moreover,  PALB2 
also  interacts  with  MRG15  (also  known  as  MORF4L1)11  a  subunit  of  HAT/HDAC 
complexes  and  with  KEAP1,  a  major  regulator  of  the  antioxidant  transcription  factor 
NRF2  (also  known  as  NFE2L2)  .  In  addition,  a  highly  conserved  chromatin-associated 
domain  (ChAM)  has  been  described  in  a  region  of  PALB2  for  which  there  are  no  known 
binding  partners  .  The  contribution  of  these  PALB2  binding  partners  and  of  the  ChAM 
domain  to  the  BRCA1-PALB2-BRCA2  HR  machinery  and/or  to  hitherto  unknown 
functions  of  PALB2,  unrelated  to  homologous  recombination  is  so  far  unclear. 

In  an  effort  to  study  how  PALB2  contributes  to  breast  cancer  suppression,  we 
generated  the  first  model  of  high  penetrance  Palb2  breast  cancer  in  the  mouse  and 
assessed  a  number  of  its  most  salient  properties.  These  insights  are  reported  below. 


RESULTS  AND  DISCUSSION 


Targeting  the  mouse  Palb2  gene  and  the  generation  of  Pa/62-deficient  ES  cells 


In  order  to  generate  a  Palb2  allele  that  could  be  conditionally  inactivated  upon 
Cre  recombinase  expression,  we  inserted  loxP  sites  flanking  exons  2  and  3  of  the  Palb2 
gene  (Fig.  SIB).  These  exons  encode  a  putative  nuclear  localization  signal  (NLS) 
sequence  and  the  PALB2  coiled-coil  domain  (Fig.  1  A).  The  latter  was  previously  shown 
to  mediate  the  PALB2  interaction  with  BRCA13'4.  Deletion  of  these  exons  resulted  in 
out-of-frame  reading  of  exon  4,  leading  to  premature  termination  of  the  protein  and 
exclusion  of  the  BRCA2-interacting  seven-bladed  WD40-type  (3-propeller  domain 
encoded  by  downstream  exons.  Due  to  the  premature  truncation  of  the  Palb2  open 
reading  frame  (ORF),  the  resulting  transcript  is  also  a  candidate  target  for  degradation  by 
the  nonsense-mediated  decay  (NMD)  pathway. 

Targeting  of  the  Palb2  locus  and  integration  of  both  loxP  sites  was  confirmed  by 
Southern  blot  analysis  (Fig.  SIC).  Heterozygous  ES  cells  (Palb2neo/+)  were  injected  into 
blastocysts  and  the  resulting  chimeras  from  2  individual  clones  were  bred  to  either  Flp- 
deleter  mice  (to  eliminate  the  Frt- flanked  Ned  cassette)  or  Cre- deleter  mice  (to  generate 
a  conventional  Palb2  knockout  allele).  Germline  transmission  occurred  from  nearly  all 
chimeras,  and  mice  were  PCR-genotyped  for  the  Flp-  or  Cre -recombined  alleles  ( Palb2fJ 
or  Palb2~,  respectively;  Fig  SIB). 

Palb2~/~  ES  cells  were  derived  from  blastocysts  of  heterozygous  Palb2l!/~  crosses 
according  to  standard  protocols.  Three  independent  ES  cell  lines  (1.1,  1.3  and  1.4)  were 
generated  from  one  such  timed  mating.  Expression  analysis  of  Palb2  mRNA  by 
quantitative  real-time  RT-PCR  (qRT-PCR)  confirmed  that  one  of  them  was  Palb2IJ/l1  and 
the  other  two  were  Palb2~/~  (Fig.  SID),  demonstrating  that  Palb2  loss  does  not  prevent  ES 
cell  derivation  and  subsequent  survival.  The  three  ES  cell  lines  that  were  derived  were 
morphologically  comparable,  proliferated  at  a  similar  rate  to  wild-type  (WT)  ES  cells  and 
could  undergo  differentiation  into  embryoid  bodies  (data  not  shown).  The  loss  of  full 
length  Palb2  expression  in  these  lines  was  confirmed  by  qRT-PCR  for  Palb2  mRNA 
(Fig.  SID)  and  by  Western  blotting,  using  an  affinity-purified  rabbit  polyclonal  anti¬ 
mouse  PALB2  antibody  (raised  against  the  first  200  residues  of  the  mouse  PALB2 
protein)  (Fig.  IB). 


ES  cells  that  are  deficient  for  Brcal  or  Brca2  have  been  notoriously  difficult  to 
isolate  and  are  severely  compromised  in  their  proliferation  (PM  ID:  8698242,  PM  ID: 
9126738).  Since  PALB2  acts  immediately  upstream  of  BRCA2  and  is  required  for 
BRCA2  localization  at  double  strand  breaks  (DSBs),  it  is  possible  that  the  viability  and 
robustness  of  the  Palb2~/~  cells  was  due  to  residual  expression  of  a  truncated  Palb2 
species  resulting  from  a  downstream  translation  initiation  that  we  could  not  detect  with 
our  antibody.  In  keeping  with  this  possibility,  Palb2~f~  ES  cells  could  not  be  derived  from 
embryos  carrying  a  conventional  PALB2  gene -trap  allele14. 

To  test  whether  the  conditional  gene  targeting  approach  that  was  employed  had 
generated  a  functionally  null  allele,  the  response  of  Palb2^1  and  Palb2~/~  ES  cells  to  DNA 
damaging  agents  that  cause  double  strand  breaks  was  analyzed.  Normally,  exposure  of 
PALB2- proficient  cells  to  ionizing  radiation  (IR)  leads  to  the  formation  of  yH2AX 
nuclear  foci  and  subsequent  recruitment  to  these  structures  of  BRCA1,  BRCA2  and 
RAD51  to  these  structures.  As  expected,  yH2AX  and  BRCA1  foci  formation  was 
unaffected  in  Palb2~/~  cells  (Fig.  1C,  Fig.  S2A).  However,  the  recruitment  of  RAD51  to 
IRIF  was  severely  compromised,  and  no  RAD5 1  colocalization  with  yH2AX  foci  could 
be  detected  after  exposure  of  cells  to  IR  (Fig.  1C).  This  defect  was  also  evident  at  the 
biochemical  level,  since  no  increase  in  chromatin  loading  of  RAD51  after  IR  was 
detected  in  Palb2~/~  cells  (Fig.  ID).  Since  biallelic  PAFB2  mutations  in  humans  cause 
Fanconi  anemia  (FA),  a  hallmark  of  which  is  increased  sensitivity  to  DNA  cross-linking 
agents  such  as  Mitomycin  C  (MMC),  the  sensitivity  of  Palb2~/~  ES  cells  to  MMC  as  well 
other  DNA  damaging  agents  was  assayed.  Both  Palb2- deficient  lines  displayed  increased 
sensitivity  to  MMC  and  IR  (Fig.  IE,  Fig.  S2B-C).  These  findings  reinforce  the  notion  that 
these  Palb2~/~  cells  are  functional  knock-outs  for  Palb2,  as  they  appear  compromised  in 
all  known  Palb2- associated  functions.  Thus,  upon  Cre-mediated  recombination  in  vivo, 
the  aforementioned  conditional  Palb2  allele  should  be  converted  to  a  Palb2- null  allele. 


Loss  of  Palb2  in  the  germline  results  in  early  embryonic  lethality 

Germline  deletion  of  Brcal  or  Brca2  results  in  early  embryonic  lethality15’17 
Although  Palb2~/~  ES  cells  displayed  no  significant  growth  defects  in  the  absence  of 


exogenous  DNA  damage  when  compared  with  Palb^f,fl  controls,  Palb2  loss  could  be 
deleterious  in  differentiated  progeny  cells,  and  thereby  negatively  affect  mouse 
development.  Indeed,  we  were  unable  to  obtain  Palb2~/~  mice  from  heterozygous  crosses 
either  at  the  time  of  weaning  or  at  birth,  consistent  to  previous  reports14, 18.  Dissection  of 
embryos  from  timed  pregnancies  revealed  that  Palb2- null  embryos  could  only  be 
recovered  up  to  El  1.5,  but  only  at  sub-Mendelian  ratios,  and  always  presenting  severe 
malformations.  At  earlier  time  points,  morphological  aberrations  of  Palb2~i~  embryos 
were  less  obvious,  although  these  mutant  embryos  were  evidently  smaller  than  WT  or 
heterozygous  littermates  (Fig.  2 A),  some  displaying  exencephaly  as  well  as 
malformations  of  the  labyrinth  and  yolk  sac-associated  hematopoietic  blood  islets  (Fig. 
2C-F).  The  finding  that  Palb2  deficiency  results  in  embryonic  lethality  detectable  at 
E8.5-E10.5  is  largely  consistent  with  earlier  reports  that  homozygous  Palb2- deficient 
mice,  generated  using  a  randomly-inserted  gene -trap  vector,  also  die  during 
embryogenesis  ~E8.514, 18. 

Embryonic  lethality  due  to  loss  of  Brcal  or  Brca2  can  be  delayed  by  concomitant 
loss  of  P53  and/or  p2119,20.  Likewise,  p53  loss  delayed  the  lethality  of  Palb2  KO 
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embryos  ,  which  also  display  increased  levels  of  p21  .  We  therefore  tested  whether  p21 
deficiency  would  also  have  an  effect  on  the  embryonic  lethality  associated  with  Palb2 
nullizygosity.  As  expected,  p2 1  deficiency  did  delay  embryonic  lethality  of  Palb2  KO 
embryos  by  2-3  days  (Fig.  2B).  A  similar  effect  was  observed  when  Palb2- deficient 
embryos  were  bred  to  a pl6INK4A- null  allele  (Fig.  2B);  however,  all  Palb2/p21  or 
Palb2/pl6INK4A  double  knockout  embryos  still  displayed  multiple  malformations  and 
impaired  growth  as  compared  with  Palb2  heterozygous  or  WT  littermates  (not  shown), 
and  were  eventually  resorbed. 

Because  the  establishment  of  the  placenta  and  onset  of  embryonic  hematopoiesis 
are  critical  developmental  steps  that  take  place  around  the  time  of  lethality  of  Palb2 
embryos,  we  set  out  to  verify  whether  the  lethality  of  Palb2  KO  embryos  could  be  by¬ 
passed  by  a  WT  placenta.  To  this  end,  we  used  the  Palb2/l/fl  and  Pa lb2~'~  ES  cells  to 
perform  tetraploid  complementation  assays.  We  found  that  embryos  derived  from  the 
Palb2~l~  ES  cells  were  underdeveloped  and  malformed  already  at  E9.5  when  compared  to 
their  Palb2^/fl  counterparts  (Fig.  21- J).  At  E12.5,  embryos  derived  from  Palb2fI/tl  ES 


looked  normal,  whereas  embryos  from  Palb2~'~  ES  cells  were  completely  degenerated  and 
resorbed  (Fig.  S3).  Likewise,  breeding  of  the  Palb2  conditional  allele  to  Meox2-Cre  KI 
mice  (in  which  Cre  is  expressed  from  the  endogenous  Meox2  locus  only  in  the  embryo 
proper  )  only  yielded  mice  in  which  Palb2  deletion  was  incomplete  (data  not  shown). 
Collectively,  these  findings  indicate  that  Palb2  is  a  developmental^  essential  gene,  and 
its  deficiency  leads  to  multiple  defects.  These  findings  are  also  in  accord  with  previous 
studies  in  which  the  lethality  of  Brea  P1'  embryos  was  not  rescued  by  tetraploid 
complementation  assay15. 


Palb2  is  a  breast  tumor  suppressor  in  mice 

In  order  to  assess  the  effect  of  Palb2  loss-of- function  in  mammary  tumorigenesis, 
we  crossed  Palb2P/fl mice  with  K14-Cre  transgenic  mice22.  K14-Cre  transgenic  mice 
preferentially  express  Cre  recombinase  in  the  basal  epithelium  of  the  mammary  ducts,  as 
well  as  in  other  epithelial  compartments  such  as  skin  and  oral  mucosa.  K14-Cre  has 
previously  been  used  to  model  Brcal  and  Brca2  loss-mediated  mammary  tumorigenesis 
in  the  mouse23’25. 

In  humans  and  mice,  mammary  tumor  formation  initiated  by  BRCA1  or  BRCA2 
loss  requires  concomitant  loss  of  functional  P53  (encoded  by  Trp53  in  mice)  ’  . 
Therefore,  we  set  out  to  generate  cohorts  of  Palb2/Trp53  double  conditional  mice  by 
crossing  Palb2fl/^;K  1 4-Cre  transgenic  mice  with  7>p53-floxed  mice  in  order  to  assess  the 
contribution  of  each  gene  alone  and  in  combination  to  suppressing  mammary  tumor 
formation  .  All  mice  that  harbored  the  Kl 4-Cre  transgene  or  conditional  alleles  for 
Palb2  and/or  Trp53  were  phenotypically  normal,  fertile,  and  capable  of  nursing  their 
litters. 

During  the  period  of  tumor  monitoring  (up  to  600  days  after  birth),  Trp53fl/fl\K14- 
Cre  female  mice  developed  spontaneous  tumors  with  a  frequency  of  -80%  and  a  mean 
tumor-free  interval  (T50)  of  320  days.  In  contrast,  Palb2^1\  Trp53^l;K  1 4-Cre  double 
conditional  mice  displayed  significantly  decreased  latency  in  tumor  formation  (T50  =  192 
days,  P  =  2.4xl0'5),  indicating  that  Palb2  loss  significantly  accelerates  tumor  formation 


on  a  Trp53  conditional  null  background  (Fig.  3A).  Most  tumors  in  this  double  conditional 
KO  group  appeared  between  130-200  days. 

Somatic  loss  of  one  Trp53  allele  displayed,  as  expected,  a  haploinsufficient  tumor 
suppressor  phenotype,  given  that  Palb2fl/fl-,Trp53fl/+  \K14-Cre  mice  developed  tumors 
significantly  faster  than  Palb2^1\ Trp53+/+;K  1 4-Cre  mice  (T50  =  225  and  420  days, 
respectively,  P  =  2.5x10'  ,  Fig.  S4C).  Palb2  loss  of  function  also  accelerated  tumor 
formation  on  a  Trp53  heterozygous  (fl/+)  background,  reiterating  the  synergistic 
interaction  of  these  two  genes  (Fig  3B). 

W/4-Oe-mediated  conditional  loss  of  Palb2  and  Trp53  predominantly  led  to 
tumor  formation  in  breast,  skin  and  oral  mucosa  (Fig.  3E),  as  previously  reported  for 
Brcal  and  Brca2 23, 25’ 26 .  While  most  of  the  tumors  found  in  Palb2++  ;Trp5  3fl/f!  ;K  1 4-Cre 
mice  were  breast  carcinomas,  Palb2ITrp53  compound  conditional  KO  mice  displayed  an 
expanded  spectrum  of  tissues  affected  by  tumors  (Fig.  3E),  suggesting  that  PALB2  tumor 
suppressor  activities  are  not  restricted  to  the  mammary  gland.  Mice  harboring  conditional 
alleles  for  Palb2  and  Trp53,  but  no  K1 4-Cre  transgene,  and  Palb2+/+;Trp53+/+;Kl 4-Cre 
mice  did  not  display  increased  tumor  susceptibility  during  the  observation  period,  despite 
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the  intrinsic  mutagenic  activity  of  Cre  in  mammalian  cells  ’  .  Taken  together,  our 
results  imply  that  the  short  latency  and  high  penetrance  of  tumor  formation  observed  in 
Palb2^1;  Trp53^;K  1 4-Cre  mice  is  primarily  a  result  of  synergy  of  the  combined  loss  of 
Palb2  and  Trp53. 

In  order  to  assess  the  contribution  of  the  loss  of  Palb2  and/or  Trp53  alleles  as 
driver  mutations  in  the  tumors  that  arose  in  K1 4-Cre  mice,  we  determined  the  gene  copy 
number  for  Palb2  and  Trp53  by  qPCR  on  tumor  genomic  DNA.  All  analyzed  tumors 
from  Palb2^/fl\  Trp53^l;K 1 4-Cre  mice  (12  out  of  12)  lost  both  floxed  copies  of  Palb2  and 
Trp53  (Fig.  3D).  Similarly,  in  all  tumors  from  Palb2tI/t\  Trp53fl/+ ; K 1 4-Cre  mice  (15  of  15) 
only  residual  signals  could  be  detected  for  Palb2  and  Trp53  allele  copy  number, 
suggesting  that  the  conditional  Palb2  and  Trp53  alleles  were  fully  recombined  in  most 
cells,  whereas  the  WT  copy  of  Trp53  was  lost  through  LOH.  On  the  other  hand,  tumors 
that  arose  in  Palb 2^/+  mice  displayed  a  heterogeneous  pattern  of  loss  of  the  conditional 
and  WT  Palb2  or  Trp53  allele.  Early  reports  describing  lack  of  PALB2  LOH  in  clinical 
tumor  samples  from  heterozygous  patients  suggested  that  PALB2  could  be  a 


haploinsufficient  tumor  suppressor  in  humans.  Other  reports  showed  that  multiple  PALB2 
tumors  revealed  PALB2  LOH,  implying  that  the  PALB2  tumor  formation  process  is  not 
rigidly  uniform  ’  ’  .  In  our  experimental  setting,  no  haploinsufficiency  was  observed 
for  Palb2,  as  indicated  by  the  comparable  latency  in  Palb2II/+  ;Trp5  3/!/i\k  1 4-Cre  and 
Palb2+/+;Trp53^l;K  1 4-Cre  tumor  development  (P  =  0.46,  Fig.  S4A).  The  same  held  true 
when  similar  cohorts  of  Palb2+/+  and  Palb2ll/+  mice  were  compared  on  a  Trp53>v+\K14- 
Cre  background  (P  =  0.96,  Fig  S4B).  Likewise,  Palb2  heterozygous  mouse  breast  tumor 
lines  displayed  proper  RAD5 1  localization  at  IRIF,  consistent  with  preserved  HR 
function.  Tumor  formation  in  this  setting  was  not  accelerated  when  compared  with 
Trp53~/~  tumor  formation  (Fig.  4C).  As  expected,  Pa/62-deficient  breast  tumor  cell  lines 
displayed  the  same  defect  in  RAD5 1  accumulation  observed  in  Palb2- deficient  primary 
cells  (Fig.  4C).  This  again  points  to  a  role  for  HR  deficiency  in  the  genesis  of  these 
tumors.  Finally,  all  analyzed  breast  tumors  from  K14-Cre  mice  were  negative  for 
estrogen  receptor  (ER),  progesterone  receptor  (PR)  and  ErbB2  (HER2),  irrespective  of 
the  germline  genotype  (Fig  3F-H).  This  is  consistent  with  what  has  been  previously 
described  for  mouse  breast  tumors  arising  from  K14-Cre- mediated  conditional  deletion  of 
tumor  suppressor  genes  ’  ’  . 

Long  latency  tumors  were  also  observed  in  Palb2fhfl;K  1 4-Cre  mice  on  a  Trp53 
WT  background  (T50  =  420  days),  but  the  tumor  formation  is  still  highly  significant 
when  compared  with  Palb2+/+;Trp53+/+;Kl 4-Cre  control  mice  (P  =  5.4xlO'10)  (Fig.  3C). 
The  majority  of  these  tumors  were  small  lesions  in  the  head/neck  cavities,  whereas  a 
smaller  subset  of  animals  also  developed  mammary  tumors.  The  finding  that  loss  of 
Palb2  alone  is  sufficient  to  induce  tumor  formation  contrasts  with  most  Brcal  and  Brca2 
mouse  models  in  which  short-medium  latency  breast  tumor  formation  due  to  somatic  loss 
of  Brcal /2  cannot  be  detected,  unless  Trp53  is  also  co-deleted  ’  ’  ’  .  One  possible 
explanation  for  this  finding  is  that  somatic  Palb2  loss  might  be  better  tolerated  than 
somatic  Brcal/2  loss  on  a  Trp53  WT  background,  which  would  be  in  keeping  with  the 
observation  that  Palb2  deficiency  leads  to  a  less  severe  lethality  phenotype  in  ES  cells 
compared  to  Brcal /Brca2  deficiency  in  the  same  setting.  These  Palb2- null  and  HR- 
deficient  cells  would  survive  and  accumulate  additional  mutations  that  would  eventually 
lead  to  tumor  formation.  The  majority  of  the  macroscopically  dissectible  tumors  that 


arose  in  Palb2^}\  Trp53 +/+  ;K  14-Cre  mice  had  recombined  both  copies  of  Palb2.  Two  out 
of  7  mammary  tumors  gave  rise  to  tumor  cell  lines.  These  tumors  harbored  inactivating 
Trp53  mutations  (not  shown),  again  suggesting  strongly  that  loss  of  p53  is  ultimately 
required  for  Palb2  loss-mediated  tumor  formation. 

Genomic  features  of  Palb  2 1  Trp  5  3 -deficient  mammary  tumors 

Genomic  instability  is  a  hallmark  of  human  cancer,  and  it  is  believed  to  promote 
tumor  initiation  and  progression.  Experimental  mouse  tumor  models  have  recapitulated 
this  aspect  of  human  tumorigenesis.  In  addition,  aberrant  tumor  genomic  profiles  in  such 
models  have  been  shown  to  correlate  with  clinico-pathological  features  of  the  relevant 
tumor  as  well  as  with  the  results  of  certain  signal  transduction  events.  In  order  to  gain 
insight  into  the  genomic  nature  of  the  tumors  that  arose  due  to  the  loss  of  Palb2,  we 
performed  high-resolution  comparative  genomic  hybridization  (CGH)  analysis  of 
Palb2/T rp53,  Brcal/Trp53,  Brca2/Trp53  and  Trp53  only-deficient  mammary  tumors. 

Segmentation  analysis  of  the  CGH  data  was  performed  for  each  tumor  in  order  to 
assess  the  number  of  genomic  segments  with  deviating  copy  number  changes  as  a  readout 
of  genomic  instability33.  The  mammary  tumors  arising  in  Palb2iu,,\  Ttp53>V)]\K  1 4-  Cre 
mice  (n=8)  displayed  higher  numbers  of  amplified  segments  of  the  genome  (log2  dose  > 
0.5)  than  Brca2^!; Trp53^;K 1 4-Cre  tumors  (n=5;  p  =  0.046,  Fig.  S5).  When  compared  to 
Brcal/Trp53  tumors  (n=5),  which  are  known  to  display  high  levels  of  genomic 
instability,  the  CGH  profiles  of  Palb2ITrp53  tumors  were  equally  fragmented  (p  =  0.94), 
indicating  that  Palb2  loss  also  leads  to  formation  of  tumors  that  display  a  high  degree  of 
genomic  instability.  On  the  other  hand,  the  number  of  homozygous  deletions  (log2  dose  < 
0.5)  detected  in  Palb2/Trp53  tumors  was  significantly  lower  than  that  observed  in 
Brcal/Trp53  tumors  (p  =  0.035;  Fig.  S5),  whereas  Palb2/Trp53  tumors  were 
indistinguishable  from  Brca2/Trp53  tumors  in  that  regard  (p  =  0.45).  Based  on  these 
analyses,  Palb2ITrp53  tumors  seem  to  be  phenotypically  closer  to  Brcal/Trp53  driven 
tumors  in  terms  of  genomic  instability,  but  still  distinct  from  tumors  generated  by  either 
Brcal/Trp53  or  Brca2!Trp53  loss.  The  implication  for  the  trends  observed  is  that  PAFB2 
might  possess  biological  roles  that  are,  at  least  in  part,  not  overlapping  with  its  role  in  the 


regulation  of  BRCA2  function.  Notably,  we  were  unable  to  identify  chromosomal  regions 
of  imbalance  that  were  unique  to  Palb2ITrp53  tumors,  but  it  is  possible  that  a  more 
comprehensive  analysis  with  a  larger  collection  of  tumor  samples  might  allow  the 
identification  of  such  regions. 


53BP1  loss  fails  to  rescue  the  HR  defect  caused  by  PALB2  deficiency 

Loss  of  53BP1  can  rescue  an  HR  defect  observed  in  BRCA1- null  cells  and  the 
lethality  of  Brea  l'1'  mice34’35  (REF  36,  PMID:  22445484).  Similarly,  decreased 
expression  of  53BP1  expression  was  linked  to  triple  negative  breast  cancers  as  well  as  to 
human  BRCA1  tumors  .  We  then  asked  whether  Trp53bpl  is  absent  from  Palb2/Trp53 
KO  tumors,  and  whether  its  absence  would  rescue  the  HR  defect  associated  with  Palb2 
loss.  Quantitative  RT-PCR  analysis  of  Trp53bpl  mRNA  in  freshly  isolated  Palb2  tumor 
samples  showed  that  Trp53bpl  mRNA  levels  were  not  significantly  downregulated  in 
Palb2- deficient  breast  tumors  in  comparison  with  Palb2  WT  tumor  controls,  although 
some  tumors  displayed  reduced  Trp53bpl  mRNA  levels  (Fig.  4A). 

To  determine  whether  HR  deficiency  due  to  Palb2  loss  could  be  complemented 
by  Trp53bpl  loss  in  primary  cells,  we  generated  PaXb2!V)l\ CD  1 9-Cre  mice  that  were  or 
were  not  deficient  in  Trp53bpl.  Cultured  primary  splenocytes  from  these  mice  were  then 
assayed  for  their  HR  competence  upon  treatment  with  PARP1  inhibitors  (PARPi),  which 
selectively  induces  DNA  damage  and  chromosomal  aberrations  in  HR-deficient  cells  . 
Treatment  with  Olaparib  (PARPi)  induced  accumulation  of  chromosomal  and  chromatid 
breaks,  and  radial  structures  in  chromosomal  spreads  from  cultured  Palb2IJ/II\  Trp53bpl+/~ 
;  CD19-Cre  primary  splenocytes.  The  number  of  chromosomal  aberrations  was 
comparable  in  Palb2lU,\  Trp53bpPA;  CD19-Cre  splenocytes,  implying  that  Trp53bpl 
deletion  did  not  complement  the  HR  defect  caused  by  Palb2  deficiency  (Fig.  S6). 
Trp53bpl  deletion  also  failed  to  rescue  the  chromosomal  aberrations  found  in  spreads 
from  PARPi-treated  Brca2fl/fl\  Trp53bpP/';  CD1 9-Cre  splenocytes.  In  the  same 
experiment,  complete  rescue  of  the  DNA  repair  deficiency  in  Brea  l^1;  Trp53bpP/~ ; 
CD19-Cre  splenocytes  was  observed  as  previously  described  ’  .  These  observations 
suggest  that  the  contributions  of  PALB2  and  BRCA2  to  HR-based  DSB  repair  are 


distinct  from  those  of  BRCA1,  and  cannot  be  complemented  by  53BP1  loss.  In  keeping 
with  existing  evidence,  PALB2  and  BRCA2  may  well  be  de  facto  downstream  HR 
effectors,  which  cannot  be  replaced  or  bypassed,  except  by  artificially  forcing  the  loading 
of  RAD51  onto  chromatin  at/near  DSB  (REF  36, 38'40) .  These  observations,  along  with 
earlier  results  (PMID:  20871615),  also  suggest  that  PARP1  inhibition  is  apotential 
therapeutic  target  in  PALB2- deficient  tumors,  to  the  same  extent  that  it  is  in  BRCA1-  and 
BRCA2- associated  tumors  . 


CONCLUSIONS 

In  summary,  we  have  shown  that  Palb2  is  a  breast  tumor  suppressor  gene  in  mice 
as  it  is  in  humans,  and  synergizes  with  Trp53  to  suppress  tumor  formation.  However, 
tumorigenesis  driven  by  Palb2  loss  in  the  mouse  is  not  entirely  suppressed  on  a  Trp53 
WT  germline  background,  unlike  most  Brcal  and  Brca2  mouse  models  of  breast 
tumorigenesis  (PMID:  16998503).  Despite  some  similarities  to  Brca2;Trp53;K14-Cre 
breast  tumors,  Palb2  tumors  still  display  certain  divergent  features  that,  together,  support 
the  notion  that  PALB2  possesses  additional  biological  functions  that  might  not  be  related 
to  HR  and/or  its  interaction  with  BRCA2,  as  judged  by  the  divergent  chromosomal 
aberration  pattern  observed  in  between  Palb2-  and  Brca2- derived  tumors  and  for  the  fact 
that  Palb2fl/fl ;K14-Cre  conditional  mice  which  are  Trp53  WT  still  develop  tumors  with 
long  latency,  unlike  Brca2f],fl\K14-Cre  mice.  Finally,  haploinsufficiency  for  Palb2  tumor 
suppression  was  not  detected  in  this  model,  although  one  cannot  rule  out  that  it  would  be 
manifest  in  a  different  model  system  and/or  with  enlarged  cohorts  of  experimental  mice. 
For  example,  the  tumors  in  this  mouse  model  driven  by  K14-Cre  were  uniformly  of  the 
triple  negative  phenotype.  This  difference  might  well  contribute  to  the  absence  of 
haploinsufficiency  in  our  system,  in  the  same  fashion  that  distinct  cell  populations  in  the 
human  mammary  gland  display  preferential  patterns  of  consecutive  LOH  events  along 
the  tumorigenesis  pathway  associated  with  BRCA1  loss41.  We  believe  that  this  mouse 
model  will  be  useful  in  further  studies  aimed  at  unraveling  the  molecular  pathways  in 
which  PALB2  plays  a  role. 


Figure  legends 

Figure  1.  Conditional  gene  targeting  of  mouse  Palb2.  A)  Schematic  representation  of 
Palb2  domains  and  the  exons  from  which  they  are  encoded.  The  shaded  area  corresponds 
to  the  truncated  open  reading  frame  encoded  upon  recombination  of  the  inserted  loxP 
recombination  sites.  B)  Western  blot  analysis  for  PALB2  in  chromatin  fraction  extracts 
(S420)  of  three  independent  ES  cell  lines.  The  full  length  mouse  PALB2  protein  is  ~120 
kDa.  An  unspecific  background  band  is  indicated  by  an  asterisk,  and  can  be  used  as  an 
internal  loading  control.  C)  Recruitment  of  RAD51  to  DSBs  marked  by  yH2AX  IRIF  2h 
after  exposure  to  5  Gy  of  ionizing  radiation  in  Palb2fl/fl  and  Palb2~-  ES  cells.  D)  Western 
blot  analysis  of  chromatin-bound  (S420)  RAD5 1  in  Palb2  l!/l!  and  Palb2~-  ES  cells  that 
received  10  Gy  of  IR  and  their  respective  unirradiated  control.  Histone  H3  was  used  as  a 
loading  control.  E)  Dose-response  curves  of  Palb2,l/fI  and  Palb2--  ES  cells  after  exposure 
to  increasing  concentrations  of  the  radiomimetic  neocarzinostatin. 

Figure  2.  Early  lethality  of  Palb2~-  embryos.  A)  Bright- field  images  of  Palb2+I+  and 
Palb2~l~  embryos  at  E10.5.  Arrow  indicates  brain  exencephaly.  B)  Graph  indicating  the  p- 
values  of  Hardy-Weinberg  disequilibrium  for  Palb2  KO  embryos  retrieved  from  Palb2+/~ 
matings  at  various  embryonic  ages.  Both  Cdknla  (p21 )-  or  Cdkn2a  (pl6INK4A)- deficient 
backgrounds  delay  the  lethality  of  Palb2- deficient  embryos  and  the  time  point  at  which 
the  counterselection  of  Palb2  KO  embryos  become  significant  (p  <0.05).  C-H) 
Hematoxylin-eosin  staining  of  placenta  (C,  F)  and  yolk  sac  blood  islets  of  WT  (C-E)  or 
Palb2'/~  embryos  (F-H).  Pictures  E  and  H  display  higher  magnification  of  the  areas 
delimited  on  D  and  G.  I-J)  Hematoxylin-eosin  staining  of  in  utero  embryo  sections  from 
tetraploid  complementation  assay  at  E9.5,  utilizing  Palb2II//I  ES  cells  (I)  or  Palb2~/~  ES 
cells  (J). 

Figure  3.  Tumor  formation  in  Palb2^/^ ;  Trp53^!;K14-Cre  mice.  A-C)  Kaplan-Meier 
curves  displaying  that  Palb2  loss  accelerates  tumor  formation  both  under  a  Trp53 
conditional  null  background  (A)  as  well  under  a  Trp53  conditional  heterozygous 
background  (B)  and  finally  on  a  Trp53  WT  background.  D)  Palb2  (top  panels)  and  Trp53 
(lower  panels)  gene  dosages  of  mammary  tumors  derived  from  Palb2/T rp53  double 


conditional  mouse  cohorts.  The  germline  Palb2  and  Trp53  genotypes  of  the  mice  are 
indicated  in  red  below  the  graphs.  E)  Spectrum  of  tumors  arising  in  mouse  cohorts  with 
different  combinations  of  Palb2  and  Trp53  alleles.  The  genotypes  of  the  mice  are  shown 
below  the  graphs.  F-H)  Mammary  tumors  arising  in  Palb2/Trp53  double  conditional 
mice  are  triple-negative.  Representative  IHC  stainings  for  ER  (F),  PR  (G)  and  HER2  (H). 
The  arrows  on  the  top  left  comer  of  F)  and  G)  indicates  an  in  situ  ductal  carcinoma 
structures  that  displays  the  typical  nuclear  positivity  for  ER  and  PR,  respectively. 

Figure  4.  The  HR  defect  in  Ra/hi-deficient  cells  and  tumors.  A)  RT-qPCR  for  Trp53bpl 
mRNA  in  freshly  isolated  tumor  samples  that  are  either  Ra/W-proficient  (+/+  and  +/-)  or 
Palb2- deficient  (-/-).  B)  Distinct  defects  observed  in  chromosome  spreads  of  acute 
chromosomal  damage  and  genome  instability  following  PARPi  treatment  are  not  rescued 
by  Trp53bpl  deletion  in  Palb2^1; CD  1 9- Cre  B-lymphocytes.  C)  Established 
Palb2/T rp53- deficient  breast  tumor  cell  lines  have  not  reversed  the  defect  of  recruitment 
of  RAD51  to  IRIF,  whereas  Palb2  heterozygosity  does  not  impair  the  proper  IRIF 
localization  of  RAD51  in  breast  tumor  lines,  compared  with  Palb2  WT  control  breast 
tumor  lines. 


Figure  SI.  A)  Schematic  representation  of  the  targeting  strategy.  The  wild-type  allele 
(WT),  targeting  construct  and  targeted  allele  are  depicted.  Dashed  lines  indicate  regions 
of  homology.  Black  triangles  represent  loxP  sites,  and  arrows  represents  the  primers  used 
for  RT-PCR.  B)  Southern  blot  of  genomic  DNA  from  targeted  ES  cell  clones  digested 
with  indicated  restriction  enzyme  (BamHI  or  BsaBI),  and  hybridized  with  5'  and  3'  probe. 
Note  that  only  some  of  the  targeted  clones  (3'  probe)  that  show  cointegration  of  the 
second  loxP  site  with  the  5'  probe  were  used.  C)  Real-time  RT-PCR  analysis  of  Palb2 
mRNA  levels  in  three  ES  cell  lines  of  the  indicated  genotypes  derived  from  heterozygous 
Palbf A  crosses. 

Figure  S2.  Response  of  Palb2~/~  cells  to  DNA  damage.  A)  Recruitment  of  BRCA1  to 
DSBs  marked  by  yH2AX  foci  2h  after  exposure  to  5  Gy  of  ionizing  radiation  both  in 
Palb2^]  and  Palb2~A  cells.  B-C)  Survival  curves  of  Palb2?l/fl  and  Palb2'/~  ES  cell  lines  to 
DNA  damaging  agents  such  as  mitomycin  C  (B)  and  IR  (C). 


Figure  S3.  Palb2  WT  placentas  do  not  rescue  the  lethality  of  Palb2~/~  embryos. 
Hematoxylin-eosin  section  staining  of  embryos  from  tetraploid  complementation  assay. 
E12.5  derived  from  WT  tetraploid  embryo  aggregation  with  either  Palb2p/fl  ES  cells  (left) 
or  Palb2~/  ES  cells  (right).  By  this  stage  (E12.5),  all  Palb2~f~  embryos  have  been 
reabsorbed  (arrows). 

Figure  S4.  Effects  of  Palb2  and  Trp53  heterozygosity  in  K14-Cre  mice.  Kaplan-Meier 
curves  of  tumor- free  survival  indicating  that  Palb2  is  not  haploinsufficient  for  tumor 
formation  either  on  a  Trp53  conditional  null  (A)  or  conditional  heterozygous  background 
(B).  On  the  other  hand,  Trp53  displays  a  strong  haploinsufficient  tumor  phenotype  on  a 
Palb background  (C). 

Figure  S5.  CGH  analysis  of  Palb2  tumors.  A)  Control  (spleen)  and  tumor  DNA  was 
hybridized  to  whole  genome  arrays  to  determine  regions  of  loss  or  gains  in  mouse  breast 
tumor  samples.  Representative  rainbow  graphs  for  each  tumor  genotype  showing  log2 
DNA  copy  number  ratio  (tumor/spleen)  across  the  entire  genome.  (B-C)  Dot  plots  of  the 
number  and  dose  ratio  of  high-level  amplification  (log2  >  0.5,  B)  or  homozygous  deletion 
events  (log2  <  -0.5,  C).  Each  dot  represents  an  amplification/deletion  event,  with  its 
corresponding  relative  dose.  Asterisks  indicate  differences  that  were  statistically 
significant  (p  <  0.05). 

Figure  S6.  Rescue  of  the  PARPi-induced  acute  chromosomal  damage  in  Brcal/2- 
deficient  B-lymphocytes  by  Trp53bpl  deletion.  Trp53bpl  deletion  is  able  to  rescue 
defects  observed  in  chromosome  spreads  from  PARPi-treated  Brcal- deficient  B- 
lymphocytes  ( Brcalfl/fl ;CD19-Cre )  (A),  but  cannot  rescue  the  damage  in  5rca2-deficient 
B-lymphocytes  (B). 
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